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Introduction 

Recurrent - wandering, conservative - dissipative, contracting - expanding, deter- 
ministic - chaotic, isometric - mixing, periodic - turbulent, distal - proximal, the list 
can go on and on. These (pairs of) words — all of which can be found in the dictio- 
nary — convey dynamical images and were therefore adopted by mathematicians to 
denote one or another mathematical aspect of a dynamical system. 

The two sister branches of the theory of dynamical systems called ergodic theory (or 
measurable dynamics) and topological dynamics use these words to describe different 
but parallel notions in their respective theories and the surprising fact is that many of 
the corresponding results are rather similar. In the following article we have tried to 
demonstrate both the parallelism and the discord between ergodic theory and topo- 
logical dynamics. We hope that the subjects we chose to deal with will successfully 
demonstrate this duahty. 

The table of contents gives a detailed listing of the topics covered. In the first 
part we have detailed the strong analogies between ergodic theory and topological 
dynamics as shown in the treatment of recurrence phenomena, equicontinuity and 
weak mixing, distality and entropy. In the case of distality the topological version 
came first and the theory of measurable distality was strongly influenced by the topo- 
logical results. For entropy theory the influence clearly was in the opposite direction. 
The prototypical result of the second part is the statement that any abstract mea- 
sure probability preserving system can be represented as a continuous transformation 
of a compact space, and thus in some sense ergodic theory embeds into topological 
dynamics. 

We have not attempted in any way to be either systematic or comprehensive. 
Rather our choice of subjects was motivated by taste, interest and knowledge and to 
great extent is random. We did try to make the survey accessible to non-specialists, 
and for this reason we deal throughout with the simplest case of actions of Z. Most 
of the discussion carries over to noninvertible mappings and to M actions. Indeed 
much of what we describe can be carried over to general amenable groups. Similarly, 
we have for the most part given rather complete definitions. Nonetheless, we did 
take advantage of the fact that this article is part of a handbook and for some of the 
definitions, basic notions and well known results we refer the reader to the earlier 
introductory chapters of volume I. Finally, we should acknowledge the fact that we 
made use of parts of our previous expositions [Hn| and 35] . 

We made the writing of this survey more pleasurable for us by the introduction 
of a few original results. In particular the following results are entirely or partially 
new. Theorem 11.21 (the equivalence of the existence of a Borel cross-section with 
the coincidence of recurrence and periodicity), most of the material in Section 4 
(on topological mild-mixing), all of subsection 7.4 (the converse side of the local 
variational principle) and subsection 7.6 (on topological determinism). 
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Part 1. Analogies 



1. POINCARE RECURRENCE VS. BIRKHOFF'S RECURRENCE 



1.1. Poincare recurrence theorem and topological recurrence. The simplest 
dynamical systems are the periodic ones. In the absence of periodicity the crudest 
approximation to this is approximate periodicity where instead of some iterate T"a; 
returning exactly to x it returns to a neighborhood of x. The first theorem in abstract 
measure dynamics is Poincare's recurrence theorem which asserts that for a finite 
measure preserving system (X, T) and any measurable set A, /x-a.e. point of A 
returns to A (see |461 Theorem 4.3.1]). The proof of this basic fact is rather simple 
and depends on identifying the set of points W G A that never return to A. These 
are called the wandering points and their measurability follows from the formula 



Now for n > 0, the sets T~^W are pairwise disjoint since x G T^"W means that the 
forward orbit of x visits A for the last time at moment n. Since n{T~^W) = fi{W) 
it follows that /i(W^) = which is the assertion of Poincare's theorem. Noting that 
A n T~^W describes the points of A which visit A for the last time at moment n, 
and that /i(U^QT~"iy) = we have established the following stronger formulation 
of Poincare's theorem. 

1.1. Theorem. For a finite measure preserving system {X, B, /i, T) and any measur- 
able set A, fi-a.e. point of A returns to A infinitely often. 

Note that only sets of the form T^'^B appeared in the above discussion so that the 
invertibility of T is not needed for this result. In the situation of classical dynam- 
ics, which was Poincare's main interest, X is also equipped with a separable metric 
topology. In such a situation we can apply the theorem to a refining sequence of 
partitions 7^, where each T^n is a countable partition into sets of diameter at most 
^. Applying the theorem to a fixed we see that /i-a.e. point comes to within ^ of 
itself, and since the intersection of a sequence of sets of full measure has full measure, 
we deduce the corollary that /i-a.e. point of X is recurrent. 

This is the measure theoretical path to the recurrence phenomenon which depends 
on the presence of a finite invariant measure. The necessity of such measure is clear 
from considering translation by one on the integers. The system is dissipative, in 
the sense that no recurrence takes place even though there is an infinite invariant 
measure. 



There is also a topological path to recurrence which was developed in an abstract 
setting by G. D. Birkhoff. Here the above example is eliminated by requiring that the 
topological space X, on which our continuous transformation T acts, be compact. It 
is possible to show that in this setting a finite T-invariant measure always exists, and 
so we can retrieve the measure theoretical picture, but a purely topological discussion 
will give us better insight. 
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A key notion here is that of minimahty. A nonempty closed, T-invariant set E G X, 
is said to be minimal if F (Z E, closed and T-invariant implies F = ^ oi F = E. If 
X itself is a minimal set we say that the system (X, T) is a minimal system. 

Fix now a point Xq E X and consider 

oo 

uj{xq) = Pi {T'^Xo : k>n}. 

n=l 

The points of ui{xq) are called cj-limit points of xq, {u = last letter of the Greek 
alphabet) and in the separable case y G uj{xo) if and only if there is some sequence 
/Cj — > oo such that T^'Xq y. If Xq € uj{xq) then Xq is called a positively recurrent 
point. 

Clearly uj{xo) is a closed and T-invariant set. Therefore, in any nonempty minimal 
set E, any point xo E E satisfies xq G w{xo) and thus we see that minimal sets have 
recurrent points. 

In order to see that compact systems {X, T) have recurrent points it remains to 
show that minimal sets always exist. This is an immediate consequence of Zorn's 
lemma applied to the family of nonempty closed T-invariant subsets of X. A slightly 
more constructive proof can be given when X is a compact and separable metric space. 
One can then list a sequence of open sets Ui, U2, ■ ■ ■ which generate the topology, and 
perform the following algorithm: 

1. set Xo = X, 

2. for i = 1, 2, ... , 

if Ur=-oo T-^'U, D X,_i put X, = X,_i, else put X, = X,_i \ [j^.^ T-^U,. 
Note that Xj 7^ and closed and thus Xqo = fli^o"^* nonempty. It is clearly 
T-invariant and for any Ui, if Ui fl Xqo 7^ then IJ^^oo ^ "(f^i l~l -^00) = -^00, which 
shows that (Xoo,T) is minimal. 

1.2. The existence of Borel cross-sections. There is a deep connection between 
recurrent points in the topological context and ergodic theory. To see this we must 
consider quasi-invariant measures. For these matters it is better to enlarge the scope 
and deal with continuous actions of Z, generated by T, on a complete separable metric 
space X. A probability measure fi defined on the Borel subsets of X is said to be 
quasi-invariant if T ■ /x ~ /i. Define such a system (X, !B, /x, T) to be conservative 
if for any measurable set A, TA C A implies n{A \ TA) = 0. 

It is not hard to see that the conclusion of Poincare's recurrence theorem holds for 
such systems; i.e. if fi{A) > 0, then fi-a.e. x returns to A infinitely often. Thus once 
again /i-a.e. point is topologically recurrent. It turns out now that the existence of a 
single topologically recurrent point implies the existence of a non-atomic conservative 
quasi-invariant measure. A simple proof of this fact can be found in jSH] for the case 
when X is compact — but the proof given there is equally valid for complete separable 
metric spaces. In this sense the phenomenon of Poincare recurrence and topological 
recurrence are "equivalent" with each implying the other. 

A Borel set -B C X such that each orbit intersects B in exactly one point is called 
a Borel cross-section for the system (X, T) . If a Borel cross-section exists, then 
no non-atomic conservative quasi-invariant measure can exist. In |H2] it is shown 
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that the converse is also valid — namely if there are no conservative quasi-invariant 
measures then there is a Borel cross-section. 

Note that the periodic points of (X, T) form a Borel subset for which a cross- 
section always exists, so that we can conclude from the above discussion the following 
statement in which no explicit mention is made of measures. 

1.2. Theorem. For a system {X, T) , with X a completely metrizable separable space, 
there exists a Borel cross-section if and only if the only recurrent points are the peri- 
odic ones. 

1.3. Remark. Already in jl21 as well as in [21] one finds many equivalent conditions 
for the existence of a Borel section for a system (X, T) . However one doesn't find 
there explicit mention of conditions in terms of recurrence. Silvestrov and Tomiyama 
[7H] established the theorem in this formulation for X compact (using C*-algebra 
methods). We thank A. Lazar for drawing our attention to their paper. 

1.3. Recurrence sequences and Poincare sequences. We will conclude this sec- 
tion with a discussion of recurrence sequences and Poincare sequences. First for some 
definitions. Let us say that D is a recurrence set if for any dynamical system [Y, T) 
with compatible metric p and any e > there is a point and a G -D with 

piT^, yo) < e. 

Since any system contains minimal sets it suffices to restrict attention here to minimal 
systems. For minimal systems the set of such y^s for a fixed e is a dense open set. 

To see this fact, let U be an open set. By the minimality there is some N such that 
for any y eY, and some < n < X, we have T"?/ G U. Using the uniform continuity 
of T"', we find now a 5 > such that if p{u, v) < 6 then for all < n < X 

p(T"u, n) < e. 

Now let zq be a point in Y and do E D such that 

(1) p(T"°^o, ^o) < S. 

For some < < X we have T"'°zq = y^ E U and from ((T)) we get p{T'^°yo, yo) < e. 
Thus points that e return form an open dense set. Intersecting over e ^ gives a 
dense Gs in Y of points y for which 

inf p{T''y, y) = 0. 

Thus there are points which actually recur along times drawn from the given recur- 
rence set. 

A nice example of a recurrence set is the set of squares. To see this it is easier to 
prove a stronger property which is the analogue in ergodic theory of recurrence sets. 

1.4. Definition. A sequence {sj} is said to be a Poincare sequence if for any finite 
measure preserving system (X, S, p, T) and any B E "B with positive measure we 
have 

p{T^^B n B) > for some Sj in the sequence. 
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Since any minimal topological system {Y, T) has finite invariant measures with 
global support, /i any Poincare sequence is recurrence sequence. Indeed for any 
presumptive constant 6 > which would witness the non-recurrence of {sj} for [Y, T), 
there would have to be an open set B with diameter less than b and having positive 
/i- measure such that T^^^B n B is empty for all {sj}. 

Here is a sufficient condition for a sequence to be a Poincare sequence: 



1.5. Lemma. If for every a G (0, 27r) 



lim 



n 

- E 



k=l 



then {sk}'^ is a Poincare sequence. 



Proof. Let {X, "B, fj,, T) be a measure preserving system and let U be the unitary 
operator defined on L'^{X, S, /x) by the action of T, i.e. 

{Uf){x) = f{Tx). 

Let Hq denote the subspace of invariant functions and for a set of positive measure B, 
let /o be the projection of 1^ on the invariant functions. Since this can also be seen 
as a conditional expectation with respect to the cr-algebra of invariant sets /o > 
and is not zero. Now since 1b — /o is orthogonal to the space of invariant functions 
its spectral measure with respect to U doesn't have any atoms at {0}. Thus from the 
spectral representation we deduce that in L^-norm 



n 

E 

1 



U 



or 



-y w 



Lb 



Lb 



/o) 







L2 



L2 



and integrating against 1^ and using the fact that /o is the projection of 1^ we see 
that 

n 



lim 

n^oo n 



- E 



n T^'^'B 



ii/oir>o 



which clearly implies that {sk} is a Poincare sequence. 



□ 



The proof we have just given is in fact von-Neumann's original proof for the mean 
ergodic theorem. He used the fact that N satisfies the assumptions of the proposition, 
which is Weyl's famous theorem on the equidistribution of {no}. Returning to the 
squares Weyl also showed that {n^a} is equidistributed for all irrational a. For 
rational a the exponential sum in the lemma needn't vanish , however the recurrence 
along squares for the rational part of the spectrum is easily verified directly so that we 
can conclude that indeed the squares are a Poincare sequence and hence a recurrence 
sequence. 

The converse is not always true, i.e. there are recurrence sequences that are not 
Poincare sequences. This was first shown by I. Kriz [^0] in a beautiful example (see 
also |8B1 Chapter 5]). Finally here is a simple problem. 
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Problem: If D is a recurrence sequence for all circle rotations is it a recurrence 

set? 

A little bit of evidence for a positive answer to that problem comes from looking 
at a slightly different characterization of recurrence sets. Let 74 denote the collection 
of sets of the form 

N{U, U) = {n: T'^'U n f/ ^ 0}, {U open and nonempty), 

where T is a minimal transformation. Denote by the subsets of N that have a non- 
empty intersection with every element of Ji. Then is exactly the class of recurrence 
sets. For minimal transformations, another description of N[U, U) is obtained by 
fixing some ?/o and denoting 

7V(?/o, U) = {n: T'^y, G U} 

Then N{U, U) = N{yo, U) —N{yo, U). Notice that the minimality of T implies that 
N{yQ, U) is a syndetic set (a set with bounded gaps) and so any N{U, U) is the 
set of differences of a syndetic set. Thus !N consists essentially of all sets of the form 
S — S where S* is a syndetic set. 

Given a finite set of real numbers {Ai, A2, . . . , Xk} and e > set 

l^(Ai, A2, . . . , Afc; e) = {n G Z : max{||?T,Aj|| < e}}, 

j 

where || ■ || denotes the distance to the closest integer. The collection of such sets forms 
a basis of neighborhoods at zero for a topology on Z which makes it a topological 
group. This topology is called the Bohr topology. (The corresponding uniform 
structure is totally bounded and the completion of Z with respect to it is a compact 
topological group called the Bohr compactification of Z.) 

Veech proved in [TS] that any set of the form S — S with S G 7^ syndetic contains 
a neighborhood of zero in the Bohr topology up to a set of zero density. It is not 
known if in that statement the zero density set can be omitted. If it could then a 
positive answer to the above problem would follow (see also P^). 

2. The equivalence of weak mixing and continuous spectrum 

In order to analyze the structure of a dynamical system X there are, a priori, two 
possible approaches. In the first approach one considers the collection of subsystems 
Y G X (i.e. closed T-invariant subsets) and tries to understand how X is built up by 
these subsystems. In the other approach one is interested in the collection of factors 
X ^ Y of the system X. In the measure theoretical case the first approach leads 
to the ergodic decomposition and thereby to the study of the "indecomposable" or 
ergodic components of the system. In the topological setup there is, unfortunately, no 
such convenient decomposition describing the system in terms of its indecomposable 
parts and one has to use some less satisfactory substitutes. Natural candidates for in- 
decomposable components of a topological dynamical system are the "orbit closures" 
(i.e. the topologically transitive subsystems) or the "prolongation" cells (which often 
coincide with the orbit closures), see j3]. The minimal subsystems are of particular 
importance here. Although we can not say, in any reasonable sense, that the study of 
the general system can be reduced to that of its minimal components, the analysis of 
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the minimal systems is nevertheless an important step towards a better understanding 
of the general system. 

This reasoning leads us to the study of the collection of indecomposable systems 
(ergodic systems in the measure category and transitive or minimal systems in the 
topological case) and their factors. The simplest and best understood indecomposable 
dynamical systems are the ergodic translations of a compact monothetic group (a 
cyclic permutation on for a prime number p, the "adding machine" on Y[^=o 
an irrational rotation z \—>- e^™2 on 5*^ = {2; G C : \z\ = 1} etc.). It is not hard to 
show that this class of ergodic actions is characterized as those dynamical systems 
which admit a model {X, X, fi, T) where X is a compact metric space, T : X — >■ 
X a surjective isometry and /i is T-ergodic. We call these systems Kronecker or 
isometric systems. Thus our first question concerning the existence of factors should 
be: given an ergodic dynamical system X which are its Kronecker factors? Recall 
that a measure dynamical system X = (X, X, /i, T) is called weakly mixing if the 
product system (X x X, X (S) X, /x x /i, T x T) is ergodic. The following classical 
theorem is due to von Neumann. The short and elegant proof we give was suggested 
by Y. Katznelson. 

2.1. Theorem. An ergodic system X is weakly mixing iff it admits no nontrivial 
Kronecker factor. 

Proof. Suppose X is weakly mixing and admits an isometric factor. Now a factor 
of a weakly mixing system is also weakly mixing and the only system which is both 
isometric and weakly mixing is the trivial system (an easy exercise). Thus a weakly 
mixing system does not admit a nontrivial Kronecker factor. 

For the other direction, if X is non-weakly mixing then in the product space X x X 
there exists a T-invariant measurable subset W such that < (/i x fi){W) < 1. For 
every x G X let W{x) = {x' G X : {x,x') G W} and let f^ = Iwix), a function in 
L'^{fi). It is easy to check that Urfx = fr-^x so that the map vr : X -^^(a*) defined 
by 7[{x) = fx, X G X is a Borel factor map. Denoting 

7r(X) = F C L^(/i), and ij = 'k^{ij), 

we now have a factor map tt : X ^ (F, z/). Now the function ||7r(x)|| is clearly 
measurable and invariant and by ergodicity it is a constant /x-a.e.; say ||7r(x)|| = 1. 
The dynamical system (F, v) is thus a subsystem of the compact dynamical system 
(-B, Ut)i where B is the unit ball of the Hilbert space i^^(/i) and Ut is the Koopman 
unitary operator induced by T on L'^{^). Now it is well known (see e.g. |SH]) that 
a compact topologically transitive subsystem which carries an invariant probability 
measure must be a Kronecker system and our proof is complete. □ 

Concerning the terminology we used in the proof of Theorem 12.11 B. O. Koopman, 
a student of G. D. Birkhoff and a co-author of both Birkhoff and von Neumann 
introduced the crucial idea of associating with a measure dynamical system X = 
(X, X, /i, T) the unitary operator Ut on the Hilbert space L'^{fi). It is now an easy 
matter to see that Theorem 12.11 can be re-formulated as saying that the system X is 
weakly mixing iff the point spectrum of the Koopman operator Ut comprises the single 
complex number 1 with multiplicity 1. Or, put otherwise, that the one dimensional 
space of constant functions is the eigenspace corresponding to the eigenvalue 1 (this 
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fact alone is equivalent to the ergodicity of the dynamical system) and that the 
restriction of Ut to the orthogonal complement of the space of constant functions has 
a continuous spectrum. 

We now consider a topological analogue of this theorem. Recall that a topo- 
logical system {X, T) is topologically weakly mixing when the product system 
{X X X,T X T) is topologically transitive. It is equicontinuous when the family 
{T" : n G Z} is an equicontinuous family of maps. Again an equivalent condition is 
the existence of a compatible metric with respect to which T is an isometry. And, 
moreover, a minimal system is equicontinuous iff it is a minimal translation on a 
compact monothetic group. We will need the following lemma. 

2.2. Lemma. Let {X,T) be a minimal system and / : X — M a T -invariant function 
with at least one point of continuity (for example this is the case when f is lower or 
upper semi- continuous or more generally when it is the pointwise limit of a sequence 
of continuous functions ), then f is a constant. 

Proof. Let Xq be a continuity point and x an arbitrary point in X. Since {T"a; : 
n G Z} is dense and as the value fiT'^x) does not depend on n it follows that 
f{x) = f{xo). □ 



2.3. Theorem. Let {X,T) be a minimal system then {X,T) is topologically weakly 
mixing iff it has no non-trivial equicontinuous factor. 

Proof. Suppose {X, T) is minimal and topologically weakly mixing and let vr : {X, T) — 
(y, T) be an equicontinuous factor. If {x, x') is a point whose T x T orbit is dense 
in X X X then {y,y') = {7r{x),7r{x')) has a dense orbit inY xY. However, if {Y,T) 
is equicontinuous then Y admits a compatible metric with respect to which T is an 
isometry and the existence of a transitive point in Y x Y implies that y is a trivial 
one point space. 

Conversely, assuming that (X x X, T x T) is not transitive we will construct an 
equicontinuous factor {Z,T) of (X, T). As (X, T) is a minimal system, there exists 
a T- invariant probability measure fi on X with full support. By assumption there 
exists an open T-invariant subset U of X x X, such that cist/ := M C X x X. By 
minimality the projections of M to both X coordinates are onto. For every y G X 
let M{y) = {x G X : (x, ?/) G M}, and let fy = lM{y) be the indicator function of the 
set M{y), considered as an element of L^{X,fi). 

Denote by vr : X L^{X, /u) the map y ^ fy. We will show that vr is a continuous 
homomorphism, where we consider (X, /i) as a dynamical system with the isometric 
action of the group {U^ : n G Z} and Uxfix) = f{Tx). Fix ?/o G X and e > 0. There 
exists an open neighborhood V of the closed set M{yo) with fi{V\M{yo)) < e. Since 
M is closed the set map y M{y),X —>■ 2^ is upper semi-continuous and we can 
find a neighborhood W of ?/o such that M{y) C V for every y G W . Thus for every 
?/ G ly we have \x{M{y) \ M{yo)) < e. In particular, ij,{M{y)) < /i(M(|/o)) + e and it 
follows that the map y t— > fi{M{y)) is upper semi-continuous. A simple computation 
shows that it is T-invariant, hence, by Lemma f2. 21 a constant. 
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With i/Q, e and V, W as above, for every y eW, fi{M{y)\M{yo)) < e and fi{M{y)) = 
fj,{M{yQ)), thus /i(M(?/)AM(?/o)) < 2e, i.e., \\fy — fyo\\i < 2e. This proves the claim 
that TT is continuous. 

Let Z = 7r(X) be the image of X in L^{^). Since vr is continuous, Z is compact. 
It is easy to see that the T-invariance of M imphes that for every n E'L and y E X, 
fT-"y = fy°T'^ so that Z is [/r-invariant and vr : (Y, T) — » [Z, Ut) is a homomorphism. 
Clearly (Z, is minimal and equicontinuous (in fact isometric). □ 

Theorem 12. 31 is due to Keynes and Robertson [HT] who developed an idea of Fursten- 
berg, [22]; and independently to K. Petersen [70] who utilized a previous work of W. 
A. Veech, [7H]. The proof we presented is an elaboration of a work of McMahon 
[nn] due to Blanchard, Host and Maass, [T^]. We take this opportunity to point out 
a curious phenomenon which recurs again and again. Some problems in topological 
dynamics — hke the one we just discussed — whose formulation is purely topological, 
can be solved using the fact that a Z dynamical system always carries an invariant 
probability measure, and then employing a machinery provided by ergodic theory. In 
several cases this approach is the only one presently known for solving the problem. 
In the present case however purely topological proofs exist, e.g. the Petersen- Veech 
proof is one such. 

3. DiSJOINTNESS: MEASURE VS. TOPOLOGICAL 

In the ring of integers Z two integers m and n have no common factor if whenever 
k\m and k\n then k = ±1. They are disjoint if m ■ n is the least common multiple 
of m and n. Of course in Z these two notions coincide. In his seminal paper of 
1967 23 , H. Furstenberg introduced the same notions in the context of dynamical 
systems, both measure-preserving transformations and homeomorphisms of compact 
spaces, and asked whether in these categories as well the two are equivalent. The 
notion of a factor in, say the measure category, is the natural one: the dynamical 
system Y = (y, y, z/, T) is a factor of the dynamical system X = {X, X, /i, T) if there 
exists a measurable map it : X ^ Y with vr(/x) = u that T o 71 = n o T. A common 
factor of two systems X and Y is thus a third system Z which is a factor of both. A 
joining of the two systems X and Y is any system W which admits both as factors 
and is in turn spanned by them. According to Furstenberg's definition the systems X 
and Y are disjoint if the product system X x Y is the only joining they admit. In 
the topological category, a joining of (X, T) and (Y, S) is any subsystem W C X x Y 
of the product system {X x Y,T x S) whose projections on both coordinates are full; 
i.e. nx{W) = X and TTyiW) = Y. (X,T) and {Y, S) are disjoint if X x F is the 
unique joining of these two systems. It is easy to verify that if (X, T) and {Y, S) are 
disjoint then at least one of them is minimal. Also, if both systems are minimal then 
they are disjoint iff the product system (X x Y, T x S") is minimal. 

In 1979, D. Rudolph, using joining techniques, provided the first example of a 
pair of ergodic measure preserving transformations with no common factor which are 
not disjoint [72]. In this work Rudolph laid the foundation of joining theory. He 
introduced the class of dynamical systems having "minimal self- joinings" (MS J), and 
constructed a rank one mixing dynamical system having minimal self-joinings of all 
orders. 
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Given a dynamical system X = {X, X, /i, T) a probability measure A on the product 
of k copies of X denoted Xi, X2, . . . , X^, invariant under the product transformation 
and projecting onto fi in each coordinate is a fc-fold self-joining. It is called an 
off-diagonal if it is a "graph" measure of the form A = gr (/i, T"^, . . . jT"*^), i.e. A 

k 

is the image of /i under the map x t— > (T"-^x,T^^x, . . . ,T"'=x) of X into Yl -^i- The 

i=l 

joining A is a product of off-diagonals if there exists a partition (Ji, . . . , Jm) of 
{1, . . . ,k} such that (i) For each /, the projection of A on Yl is an off-diagonal, (ii) 

The systems Yl -^i^ ^ — ^ — ''^^ independent. An ergodic system X has minimal 

self-joinings of order k if every fc-fold ergodic self-joining of X is a product of 
off-diagonals. 

In [72\ Rudolph shows how any dynamical system with MSJ can be used to con- 
struct a counter example to Furstenberg's question as well as a wealth of other counter 
examples to various questions in ergodic theory. In del Junco, Rahe and Swanson 
were able to show that the classical example of Chacon ^H] has MSJ, answering a 
question of Rudolph whether a weakly but not strongly mixing system with MSJ 
exists. In j2H] Glasner and Weiss provide a topological counterexample, which also 
serves as a natural counterexample in the measure category. The example consists 
of two horocycle flows which have no nontrivial common factor but are nevertheless 
not disjoint. It is based on deep results of Ratner [7T] which provide a complete 
description of the self joinings of a horocycle flow. More recently an even more strik- 
ing example was given in the topological category by E. Lindenstrauss, where two 
minimal dynamical systems with no nontrivial factor share a common almost 1-1 
extension, [63j. 

Beginning with the pioneering works of Furstenberg and Rudolph, the notion of 
joinings was exploited by many authors; Furstenberg 1977 [21], Rudolph 1979 [7^ . 
Veech 1982 [HI], Ratner 1983 71, del Junco and Rudolph 1987 [SH], Host 1991 
[H], King 1992 [5H], Glasner, Host and Rudolph 1992 Thouvenot 1993 |77|, 
Ryzhikov 1994 [73], Kammeyer and Rudolph 1995 (2002) [2S], del Junco, Lemahczyk 
and Mentzen 1995 [5T], and Lemahczyk, Parreau and Thouvenot 2000 to men- 
tion a few. The negative answer to Furstenberg's question and the consequent works 
on joinings and disjointness show that in order to study the relationship between two 
dynamical systems it is necessary to know all the possible joinings of the two systems 
and to understand the nature of these joinings. 

Some of the best known disjointness relations among families of dynamical systems 
are the following: 

• id ± ergodic, 

• distal _L weakly mixing f|23p. 

• rigid _L mild mixing f|27j). 

• zero entropy ± K-systems (^3]), 

in the measure category and 

• F-systems _L minimal (|23j). 

• minimal distal _L weakly mixing, 

• minimal zero entropy _L minimal UPE-systems ([H]), 
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in the topological category. 

4. Mild mixing: measure vs. topological 

4.1. Definition. Let X = {X,X, fi,T) be a measure dynamical system. 

1. The system X is rigid if there exists a sequence nk oo such that 

lim/i(T"Mn A) = fM{A) 

for every measurable subset A of X. We say that X is {nfc}-rigid. 

2. An ergodic system is mildly mixing if it has no non-trivial rigid factor. 

These notions were introduced in |2Z|. The authors show that the mild mixing 
property is equivalent to the following multiplier property. 

4.2. Theorem. An ergodic system X = {X,X, fi,T) is mildly mixing iff for every 
ergodic (finite or infinite) measure preserving system {Y,'^,i>,T), the product system 

{X xY,ijx u,T X T), 

is ergodic. 

Since every Kronecker system is rigid it follows from Theorem 12 . II that mild mixing 
implies weak mixing. Clearly strong mixing implies mild mixing. It is not hard to 
construct rigid weakly mixing systems, so that the class of mildly mixing systems is 
properly contained in the class of weakly mixing systems. Finally there are mildly 
but not strongly mixing systems; e.g. Chacon's system is an example (see Aaronson 
and Weiss P]). 

We also have the following analytic characterization of mild mixing. 

4.3. Proposition. An ergodic system X is mildly mixing iff 

limsup (pf{n) < 1, 

n— >oo 

for every matrix coefficient (pf, where for f G L'^{X,fi), \\f\\ = 1, (t)f{n) := {Ur^f, f). 

Proof. If X — s> Y is a rigid factor, then there exists a sequence — > oo such that 
Ux^i id strongly on z/). For any function / G LKY,^) with ||/|| = 1, 

we have limj^oo = 1- Conversely, if limj^oo = 1 for some rii y oo 

and / G Lq(X, /x), ||/|| = 1, then limj^oo^T"^/ = /• Clearly / can be replaced 
by a bounded function and we let A be the sub-algebra of L°° (X, /x) generated by 
{Ur^f : n G Z}. The algebra A defines a non-trivial factor X — Y such that 
Ux^i id strongly on u). □ 

We say that a collection 3" of nonempty subsets of Z is a family if it is hereditary 
upward and proper (i.e. A G B and A G 3" implies B G 3^, and 3" is neither empty 
nor all of 2^). 

With a family 3^ of nonempty subsets of Z we associate the dual family 

T = {E : EnF ^dsy F eS"}. 

It is easily verified that 3"* is indeed a family. Also, for families, G 3^2 =^ 3^1 D 3^2 ^ 
and 3"** = 3". 
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We say that a subset J of Z has uniform density 1 if for every < A < 1 there 
exists an N such that for every interval / C Z of length > N we have |J H / | > A|/|. 
We denote by 2) the family of subsets of Z of uniform density 1. It is also easy to see 
that D has the finite intersection property. 

Let 5" be a family of nonempty subsets of Z which is closed under finite intersections 
(i.e. 3" is a filter). Following [23] we say that a sequence {xn : n G Z} in a topological 
space X J'-converges to a point x G X if for every neighborhood V of x the set 
{n : Xn & V} is in 5". We denote this by 

5"- limx„ = X. 

We have the following characterization of weak mixing for measure preserving sys- 
tems which explains more clearly its name. 

4.4. Theorem. The dynamical system X = (X, X, /x, T) is weakly mixing iff for every 
A, B E X we have 

•D - lim/i(T-"v4 n 5) = i2{A)i2{B). 

An analogous characterization of measure theoretical mild mixing is obtained by 
considering the families of IP and IP* sets. An IP-set is any subset of Z containing 
a subset of the form IP{ni} = {nj^ + + ■ ■ ■ + Ui^, : ii < 12 < ■ ■ ■ < ifc}, for some 
infinite sequence {nj}^^. We let U denote the family of IP-sets and call the elements 
of the dual family J*, IP* -sets. Again it is not hard to see that the family of /P*-sets 
is closed under finite intersections. For a proof of the next theorem we refer to [22] • 

4.5. Theorem. The dynamical system X = (X, X, /x, T) is mildly mixing iff for every 
A, B E X we have 

T - lim /i(T-M nB)= fi{A)fi{B). 

We now turn to the topological category. Let (X, T) be a topological dynamical 
system. For two non-empty open sets U,V G X and a point a; G X set 

N{U, V) = {neZ: T^'U H V ^ 0}, X+(f/, V) = N{U, V) n Z+ 
and N{x, V) = {n e Z : T^'x e V}. 

Notice that sets of the form N{U, U) are symmetric. 

We say that (X, T) is topologically transitive (or just transitive) if N{U, V) is 
nonempty whenever U,V (Z X are two non-empty open sets. Using Baire's category 
theorem it is easy to see that (for metrizable X) a system (X, T) is topologically 
transitive iff there exists a dense Gs subset Xq C X such that Ot{x) = X for every 

X G Xq. 

We define the family ff'thick of thick sets to be the collection of sets which contain 
arbitrary long intervals. The dual family I^synd = S^twck is the collection of syndetic 
sets — those sets A C Z such that for some positive integer N the intersection of A 
with every interval of length N is nonempty. 

Given a family U we say that a topological dynamical system (X, T) is J'-recurrent 
if N{A, y4) G 5" for every nonempty open set A G X. We say that a dynamical system 
is ff'-transitive if N{A, 5) G 5" for every nonempty open sets A,BgX. The class of 
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J'-transitive systems is denoted by £g^. E.g. in this notation the class of topologically 
mixing systems is Scofinite, where we call a subset A C Z co-finite when Z \ A is 
a finite set. We write simply £ = Sinfinite for the class of recurrent transitive 
dynamical systems. It is not hard to see that when X has no isolated points {X, T) 
is topologically transitive iff it is recurrent transitive. From this we then deduce that 
a weakly mixing system is necessarily recurrent transitive. 

In a dynamical system {X, T) a point x G X is a wandering point if there exists 
an open neighborhood U of x such that the collection {T^U : G Z} is pairwise 
disjoint. 

4.6. Proposition. Let (X,T) he a topologically transitive dynamical system; then the 
following conditions are equivalent: 

1. (X, T) G £ infinite- 

2. The recurrent points are dense in X . 

3. (X,T) has no wandering points. 

4. The dynamical system {Xoo,T), the one point compactification of the integers 
with translation and a fixed point at infinity, is not a factor of (X,T) . 

Proof 1 ^ 4 If TT : X ^ X^o is a factor map then, clearly N{n'^{0), 7r"^(0)) = {0}. 

4 =^ 3 If t/ is a nonempty open wandering subset of X then {T^U : j G Z} U {X \ 
[j{T^U : j G Z}) is a partition of X. It is easy to see that this partition defines a 
factor map it : X X^o- 

3^2 This implication is a consequence of the following: 

4.7. Lemma. // the dynamical system (X,T) has no wandering points then the re- 
current points are dense in X. 

Proof. For every 5 > put 

^5 = {x G X : 3j ^ 0, d{T^x, x) < 5}. 

Clearly is an open set and we claim that it is dense. In fact given x G X and e > 
there exists j 7^ with 

V B,{x) r\ B,{x) ^ 0. 

If y is a point in this intersection then d{T~^y,y) < 2e. Thus for e < 6/2 we have 
y E As and d{x, y) < e. Now by Baire's theorem 

00 

A = Pi Ai/fc 

k=l 

is a dense Gs subset of X and each point in A is recurrent. □ 

2^1 Given U, V nonempty open subsets of X and k G N{U, V) let Uq be the 
nonempty open subset Uq = U n T~''V. Check that N{Uo, Uq) + k C N{U, V). By 
assumption N{Uo,Uo) is infinite and a fortiori so is N{U,V). This completes the 
proof of Proposition 14.61 □ 

A well known characterization of the class WM of topologically weakly mixing 
systems is due to Furstenberg: 



4.8. Theorem. WM = £thick. 
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Following jS] we call the systems in Sgynd topologically ergodic and write TE 
for this class. This is a rich class as we can see from the following claim from 39j- 
Here MIN is the class of minimal systems and E the class of -E-systems; i.e. those 
transitive dynamical systems {X, T) for which there exists a probability invariant 
measure with full support. 

4.9. Theorem. MIN, E C TE. 

Proof. 1. The claim for MIN is immediate by the well known characterization of 
minimal systems: (X, T) is minimal iff N{x, U) is syndetic for every x G X and 
nonempty open U d X. 

2. Given two non-empty open sets f/, V in X, choose /c G Z with T^U fl V 7^ 0. Next 
set Uq = T-^V n U, and observe that k + N{Uo, Uq) C N{U, V). Thus it is enough to 
show that N{U, U) is syndetic for every non-empty open U. We have to show that 
N{U, U) meets every thick subset B dTj. By Poincare's recurrence theorem, N{U, U) 
meets every set of the form A — A = {n — m : n, m d A} with A infinite. It is an easy 
exercise to show that every thick set B contains some D~^{A) = {an — '■ n > m} 
for an infinite sequence A = {a„}. Thus ^ N{U, U) n ±D+{A) C N{U, U) n ±B. 
Since N{U, U) is symmetric, this completes the proof. □ 

We recall (see the previous section) that two dynamical systems (X, T) and (Y, T) 
are disjoint if every closed T x T-invariant subset of X x y whose projections on X 
and Y are full, is necessarily the entire space X x y . It follows easily that when (X, T) 
and {Y,T) are disjoint, at least one of them must be minimal. If both (X, T) and 
{Y, T) are minimal then they are disjoint iff the product system is minimal. We say 
that (X, T) and {Y, T) are weakly disjoint when the product system (X xY,TxT) 
is transitive. This is indeed a very weak sense of disjointness as there are systems 
which are weakly disjoint from themselves. In fact, by definition a dynamical system 
is topologically weakly mixing iff it is weakly disjoint from itself. 

If P is a class of recurrent transitive dynamical systems we let P"^ be the class of 
recurrent transitive dynamical systems which are weakly disjoint from every member 
of P 

P^ = {(X, T) : X X y G £ for every (F, T) G 
We clearly have P C Q ^ P^ D and P^^^ = P^. 

For the discussion of topologically mildly mixing systems it will be convenient to 
deal with families of subsets of Z+ rather than Z. If 3" is such a family then 

£gr = {(X, T) : N+{A, B) e 3" for every nonempty open A, B C X}. 

Let us call a subset of Z+ a S IP-set (symmetric IP-set), if it contains a subset of the 
form 

SIP{ni} = {na-n,3>0: n„, G IP{ni} U {0}}, 

for an IP sequence IP{ni} C Z+. Denote by S the family of SIP sets. It is not hard 
to show that 

S^thick C § C J, 

(see [2S])- Hence 3^syndetic ^ §* 3 J*, hence £synd ^ £§* D £j*, and finally 

Sfwnd ^ Sg* C £^, . 



16 



E. GLASNER AND B. WEISS 



4.10. Definition. A topological dynamical system {X,T) is called topologically 
mildly mixing if it is in £§* and we denote the collection of topologically mildly 
mixing systems by MM = £§•. 



4.11. Theorem. A dynamical system is in £ ijf it is weakly disjoint from every 
topologically mildly mixing system: 

£ = MM^. 

And conversely it is topologically mildly mixing iff it is weakly disjoint from every 
recurrent transitive system: 

MM = £^. 

Proof. 1. Since £§* is nonvacuous (for example every topologically mixing system is 
in £§.), it follows that every system in £§; is in £. 

Conversely, assume that (X,T) is in £ but (X,T) ^ £§,, and we will arrive at a 
contradiction. By assumption there exists (Y,T) G £§* and a nondense nonempty 
open invariant subset W G X x Y . Then nx{W) = O is a nonempty open invariant 
subset of X. By assumption O is dense in X. Choose open nonempty sets Uq G X 
and Vq G Y with Uq x Vq G W. By Proposition 14.61 there exists a recurrent point 
Xq in Uq G O. Then there is a sequence oo such that for the IP-sequence 

{n^} = IP{ni}Zi, /P-hmr""Xo = Xq (see PI). Choose io such that T""Xo G Uq for 
ria G J = IP{ni}i>iQ and set D = SIP (J). Given V a nonempty open subset of Y we 
have: 

D n N{Vo, V) ^ 0. 

Thus for some a, [3 and fo G Vq, 

T""-"''(T"%o,^^o) = (r"°xo,T"'^"''''wo) e (f/o xV)f\W. 
We conclude that 

[xq] xY G c\sW. 

The fact that in an £ system the recurrent points are dense together with the 
observation that {a;o} xY G c\sW for every recurrent point xq G O, imply that W 
is dense in X x F, a contradiction. 

2. From part 1 of the proof we have £ = £§*, hence £"^ = £§;"^ D £§*. 

Suppose (X,T) G £ but (X,T) ^ £§., we will show that (X,T) ^ £^. There 
exist t/, V C X, nonempty open subsets and an IP-set I = IP{nj} for a monotone 
increasing sequence {rii < n2 < ■ ■ ■} with 

N{u, n D = 0, 

where 

D = {ua - : Ua, G I, > n^}. 
If (X,T) is not topologically weakly mixing then X x X ^ 8. hence (X,T) ^ £"^. So 
we can assume that (X,T) is topologically weakly mixing. Now in X x X 

N{U xV,VxU) = N{U, V) n N{V, U) = N{U, V) n -N{U, V), 

is disjoint from D U —D, and replacing X by X x X we can assume that N{U, V) fl 
{D U -D) = 0. In fact, if X G £^ then X x F G £ for every F G £, therefore 
X X (X X F) G £ and we see that also X x X G £^. 
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By going to a subsequence, we can assume that 

k 

lim rik+i — y rii = oo. 

i=l 

in which case the representation of each n G / as n = no, = n^^ + + ■ ■ ■ + nj^. ; a = 
{zi < 22 < ■ ■ ■ < ik} is unique. 

Next let yo G {0, 1}^ be the sequence yo = 1/. Let Y be the orbit closure of yo in 
{0, 1}^ under the shift T, and let [1] = {yeY : y{0) = 1}. Observe that 

Niyo, [1]) = /. 

It is easy to check that 

/P-limT"'^j/o = 2/o. 

Thus the system (Y,T) is topologically transitive with yo a recurrent point; i.e. 
(Y,T) e £. 
We now observe that 

iV([l], [1]) = N{yo, [1]) - N{yo, [1]) =I-I = DU-DU {0}. 

If X X y is topologically transitive then in particular 

N{U x[l],Vx [1]) = N{U, V) n iV([l], [1]) = 

iV(f/, V) n (D U -D U {0}) = infinite set. 

But this contradicts our assumption. Thus X x Y ^ E and (X,T) ^ £"^. This 
completes the proof. □ 

We now have the following: 

4.12. Corollary. Every topologically mildly mixing system is weakly mixing and topo- 
logically ergodic: 

MM C WM n TE. 

Proof. We have £§. C £ = £§,, hence for every (X,T) G £§., X x X G £ i.e. 
(X,T) is topologically weakly mixing. And, as we have already observed the inclusion 
S'syndctic D §*, entails TE = £synd D £§* = MM. □ 

To complete the analogy with the measure theoretical setup we next define a topo- 
logical analogue of rigidity. This is just one of several possible definitions of topological 
rigidity and we refer to jHZI for a treatment of these notions. 

4.13. Definition. A dynamical system (X, T) is called uniformly rigid if there 
exists a sequence Uk y oc such that 

lim sup d{T^''x, x) = 0, 

i.e. limfc^oo 7""* = id in the uniform topology on the group of homeomorphism of 
H{X) of X. We denote by the collection of topologically transitive uniformly rigid 
systems. 

In p7j the existence of minimal weakly mixing but nonetheless uniformly rigid 
dynamical systems is demonstrated. However, we have the following: 
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4.14. Lemma. A system which is both topologically mildly mixing and uniformly rigid 
is trivial. 

Proof. Let (X,T) be both topologically mildly mixing and uniformly rigid. Then 

A = cls{r" : n G Z} C H{X), 

is a Polish monothetic group. 

Let T"* be a sequence converging uniformly to id, the identity element of A. 
For a subsequence we can ensure that {n^} = IPjraj} is an IP-sequence such that 
IP- lim T"" = id in A. If X is nontrivial we can now find an open ball B = Bs{xq) C X 
with TBnB = Put f/ = 5^/2 (xo) and V = TU; then by assumption N{U, V) is 
an SIP*-set and in particular: 

Vao 3a, /? > ao, Ua — np E N{U, V). 

However, since IP- lim T"" = id, we also have eventually, T"'"~"'fU G B; a. contradic- 
tion. □ 



4.15. Corollary. A topologically mildly mixing system has no nontrivial uniformly 
rigid factors. 

We conclude this section with the following result which shows how these topolog- 
ical and measure theoretical notions are related. 

4.16. Theorem. Let {X,T) be a topological dynamical system with the property that 
there exists an invariant probability measure fi with full support such that the associ- 
ated measure preserving dynamical system {X, X, /i, T) is measure theoretically mildly 
mixing then {X, T) is topologically mildly mixing. 

Proof. Let {Y, S) be any system in £; by Theorem 14.111 it suffices to show that (X x 
y, T X 5") is topologically transitive. Suppose W d XxY isa. closed T x S-invariant set 
with int W ^ ^. Let U <Z X,V d V he two nonempty open subsets with U xV d W . 
By transitivity of (F, S) there exits a transitive recurrent point yo eV . By theorems 
of Glimm and Effros |42J, [21J, and Katznelson and Weiss |^ (see also Weiss |82j). 
there exists a (possibly infinite) invariant ergodic measure u otiY with i^iV) > 0. 

Let /i be the probability invariant measure of full support on X with respect to 
which {X, X, fi, T) is measure theoretically mildly mixing. Then by [^Zj the measure 
IJ, XV is ergodic. Since /x x !/{]¥) > fix z/(f/ x F) > we conclude that fi x z/(iy ) = 
which clearly implies W = X x Y . □ 

We note that the definition of topological mild mixing and the results described 
above concerning this notion are new. However independently of our work Huang 
and Ye in a recent work also define a similar notion and give it a comprehensive and 
systematic treatment, ^ij. The first named author would like to thank E. Akin for 
instructive conversations on this subject. 

Regarding the classes WM and TE let us mention the following result from |H3] . 

4.17. Theorem. 

TE = WM^. 



For more on these topics we refer to [22], [En], |1H] and 
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5. Distal systems: topological vs. measure 

As noted above the Kronecker or minimal equicontinuous dynamical systems can 
be considered as the most elementary type of systems. What is then the next stage? 
The clue in the topological case, which chronologically came first, is to be found in 
the notion of distality. A topological system {X, T) is called distal if 

inf ci(T"a;, T"x') > 

for every x x' in X. It is easy to see that this property does not depend on the choice 
of a metric. And, of course, every equicontinuous system is distal. Is the converse 
true? Are these notions one and the same? The dynamical system given on the unit 
disc D = {z & C : \z\ < 1} hj the formula Tz = z exp{27Ti\z\) is a counter example, 
it is distal but not equicontinuous. However it is not minimal. H. Furstenberg in 
1963 noted that skew products over an equicontinuous basis with compact group 
translations as fiber maps are always distal, often minimal, but rarely equicontinuous, 
j22j . A typical example is the homeomorphism of the two torus = M^/Z^ given 
by T{x, y) = {x + a,y + x) where a & M/Z is irrational. Independently and at about 
the same time, it was shown by L. Auslander, L. Green and F. Hahn that minimal 
nilflows are distal but not equicontinuous, [U]. These examples led Furstenberg to his 
path breaking structure theorem, [22] • 

Given a homomorphism vr : {X,T) — > {y,T) let i?^ = {{x,x') : tt{x) = 7t{x')}. We 
say that the homomorphism vr is an isometric extension if there exists a continuous 
function d : Rj^ ^ M. such that for each y the restriction of d to 7i^^{y) x 7i^^{y) 
is a metric and for every x,x' G TT~'^{y) we have d{Tx,Tx') = d{x,x'). 

If is a compact subgroup of Aut {X, T) (the group of homeomorphisms of X 
commuting with T, endowed with the topology of uniform convergence) then the 
map X I— >■ Kx defines a factor map (X, T) {Y, T) with Y = X/ K and i^^r = 
{{x,kx) : X E X, k E K}. Such an extension is called a group extension. It 
turns out, although this is not so easy to see, that when {X,T) is minimal then 
vr : {X, T) — > [Y, T) is an isometric extension iff there exists a commutative diagram: 

(X,T) 




(r,T) 

where {X, T) is minimal and (X, T) (X, T) is a group extension with some compact 
group K C Aut (X, T) and the map p is the quotient map from X onto X defined 
by a closed subgroup H of K. Thus Y = X / K and X = X/H and we can think of 
vr as a homogeneous space extension with fiber K/H. 

We say that a (metrizable) minimal system (X, T) is an / system if there is 
a (countable) ordinal rj and a family of systems {(Xe,xe)}e<^ such that (i) Xq is 
the trivial system, (ii) for every 6 < rj there exists an isometric homomorphism 
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(j)o : Xqj^i — i> Xg, (iii) for a limit ordinal X < f] the system Xx is the inverse limit of 
the systems {Xg}g^x (i-e. Xx = Ve<A(^e, a^e)), and (iv) X^ = X. 

5.1. Theorem (Furstenberg's structure theorem). A minimal system is distal iff it 
is an I-system. 



W. Parry in his 1967 paper jnHl suggested an intrinsic definition of measure distality. 
He defines in this paper a property of measure dynamical systems, called "admitting 
a separating sieve" , which imitates the intrinsic definition of topological distality. 

5.2. Definition. Let X be an ergodic dynamical system. A sequence Ai D A2 D ■ ■ ■ 
of sets in X with > and 0, is called a separating sieve if there 

exists a subset Xq C X with /i(Xo) = 1 such that for every x,x' G Xq the condition 
"for every n G N there exists /c G Z with T^x,T^x' G implies x = x', or in 
symbols: 



We say that the ergodic system X is measure distal if either X is finite or there 
exists a separating sieve. 

Parry showed that every measure dynamical system admitting a separating sieve 
has zero entropy and that any T-invariant measure on a minimal topologically distal 
system gives rise to a measure dynamical system admitting a separating sieve. 

If X = (X, X, /i, T) is an ergodic dynamical system and K C Aut (X) is a com- 
pact subgroup (where Aut (X) is endowed with the weak topology) then the system 
Y = X/i^ is well defined and we say that the extension vr : X — Y is a group ex- 
tension. Using (0) we can define the notion of isometric extension or homogeneous 
extension in the measure category. We will say that an ergodic system admits a 
Purstenberg tower if it is obtained as a (necessarily countable) transfinite tower 
of measure isometric extensions. In 1976 in two outstanding papers [SZj, [HE] R- 
Zimmer developed the theory of distal systems (for a general locally compact acting 
group). He showed that, as in the topologically distal case, systems admitting Parry's 
separating sieve are exactly those systems which admit Purstenberg towers. 

5.3. Theorem. An ergodic dynamical system is measure distal iff it admits a Pursten- 
berg tower. 




In jUl] E. Lindenstrauss shows that every ergodic measure distal Z-system can be 
represented as a minimal topologically distal system. For the exact result see Theorem 
USD below. 
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6. FURSTENBERG-ZlMMER STRUCTURE THEOREM VS. ITS TOPOLOGICAL PI 

VERSION 

Zimmer's theorem for distal systems leads directly to a structure theorem for 
the general ergodic system. Independently, and at about the same time, Fursten- 
berg proved the same theorem, He used it as the main tool for his 

proof of Szemeredi's theorem on arithmetical progressions. Recall that an exten- 
sion IT : (X, X, yLi, T) — i> (y, y,z/, T) is a weakly mixing extension if the relative 
product system X x X is ergodic. (The system X x X is defined by the T x T 

Y Y 

invariant measure 

/i X /i = / jj^yX jj,ydiy{y), 
Jy 

on X X X, where yU = jy ^ydv{y) is the disintegration of /i over v.) 

6.1. Theorem (The Furstenberg-Zimmer structure theorem). Let X he an ergodic 
dynamical system. 

1. There exists a maximal distal factor : X ^ Z with cf) is a weakly mixing 
extension. 

2. This factorization is unique. 



Is there a general structure theorem for minimal topological systems? Here, for 
the first time, we see a strong divergence between the measure and the topological 
theories. The culpability for this divergence is to be found in the notions of proxi- 
mality and proximal extension, which arise naturally in the topological theory but do 
not appear at all in the measure theoretical context. In building towers for minimal 
systems we have to use two building blocks of extremely different nature (isometric 
and proximal) rather than one (isometric) in the measure category. A pair of points 
(x, x') G X X X is called proximal if it is not distal, i.e. if inf^g^ (i(T"a;, T"a;') = 0. 
An extension vr : (X, T) — > (Y, T) is called proximal if every pair in R^^ is proxi- 
mal. The next theorem was developed gradually by several authors (Veech, Glasner- 
Ellis-Shapiro, and McMahon, [7^], [21], jHIl, [HOI)- We need first to introduce some 
definitions. We say that a minimal dynamical system (X, T) is strictly PI (proximal 
isometric) if it admits a tower consisting of proximal and isometric extensions. It is 
called a PI system if there is a strictly PI minimal system (X, T) and a proximal 
extension ^ : X ^ X. An extension vr : X ^ F is a RIC extension (relatively 
incontractible) if for every G N and every y E Y the set of almost periodic points 
in Xy = n~^{y) x 7i~^{y) x ■ ■ ■ x 7i~^{y) {n times) is dense. (A point is called almost 
periodic if its orbit closure is minimal.) It can be shown that a every isometric (and 
more generally, distal) extension is RIC. Also every RIC extension is open. Finally a 
homomorphism tt : X — > F is called topologically weakly mixing if the dynamical 
system {R-„,T x T) is topologically transitive. 

The philosophy in the next theorem is to regard proximal extensions as 'negligible' 
and then the claim is, roughly (i.e. up to proximal extensions), that every minimal 
system is a weakly mixing extension of its maximal PI factor. 
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6.2. Theorem (Structure theorem for minimal systems). Given a metric minimal 
system {X,T), there exists a countable ordinal rj and a canonically defined commuta- 
tive diagram (the canonical PI-Tower) 



ei 




^ — Y^o — Zi ^ — Yi •■■ Yy < Zy^i ^ Yyj^i ■■■ 1^ — Yoo 

Qo Pi 9i Pv+i du+i 

where for each v <rj,TTy is RIC, pu is isometric, 9^, 6^ are proximal extensions andnoo 
is RIC and topologically weakly mixing extension. For a limit ordinal u, X^, Y^, tt^ etc. 
are the inverse limits (or joins) of X^^Y^^^Hi^ etc. for l < v. Thus Xoo is a proximal 
extension of X and a RIC topologically weakly mixing extension of the strictly PI- 
system Yoo- The homomorphism 71^0 is an isomorphism (so that X^o = Yoo) iff X is 
a Pl-system. 



We refer to [^2] for a review on structure theory in topofogical dynamics. 



7. Entropy: measure and topological 

7.1. The classical variational principle. For the definitions and the classical re- 
sults concerning entropy theory we refer to |46j, Section 3.7 for measure theory entropy 
and Section 4.4 for metric and topological entropy. The variational principle asserts 
that for a topological Z-dynamical system (X, T) the topological entropy equals the 
supremum of the measure entropies computed over all the invariant probability mea- 
sures on X. It was already conjectured in the original paper of Adler, Konheim 
and McAndrew j2] where topological entropy was introduced; and then, after many 
stages (mainly by Goodwyn, Bowen and Dinaburg; see for example ^7]) matured 
into a theorem in Goodman's paper |43| . 

7.1. Theorem (The variational principle). Let {X,T) be a topological dynamical 
system, then 

/itop(X,T) = sup{/i^ : /i G Mt(X)} = sup{/i^ : /i G M^'^{X)}. 

This classical theorem has had a tremendous influence on the theory of dynamical 
systems and a vast amount of literature ensued, which we will not try to trace here 
(see |46| Theorem 4.4.4]). Instead we would like to present a more recent development. 

7.2. Entropy pairs and UPE systems. As we have noted in the introduction, the 
theories of measurable dynamics (ergodic theory) and topological dynamics exhibit a 
remarkable parallelism. Usually one translates 'ergodicity' as 'topological transitiv- 
ity', 'weak mixing' as 'topological weak mixing', 'mixing' as 'topological mixing' and 
'measure distal' as 'topologically distal'. One often obtains this way parallel theorems 
in both theories, though the methods of proof may be very different. 

What is then the topological analogue of being a K-system? In jH] and P F. 
Blanchard introduced a notion of 'topological for Z-systems which he called UPE 
(uniformly positive entropy). This is defined as follows: a topological dynamical 
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system (X, T) is called a UPE system if every open cover of X by two non-dense 
open sets U and V has positive topological entropy. A local version of this definition 
led to the concept of an entropy pair. A pair (x, x') E X x X, x ^ x' is an entropy 
pair if for every open cover U = {U, V} of X, with x G int (f/^) and x' G int (V^), 
the topological entropy h{li) is positive. The set of entropy pairs is denoted by 
Ex = E^x,T) and it follows that the system (X, T) is UPE iff Ex = {X x X)\ A. In 
general E* = Ex U A is a T x T-invariant closed symmetric and reflexive relation. Is 
it also transitive? When the answer to this question is affirmative then the quotient 
system X/E*^ is the topological analogue of the Pinsker factor. Unfortunately this 
need not always be true even when (X, T) is a minimal system (see jH] for a counter 
example). 

The following theorem was proved in Glasner and Weiss jlUj . 

7.2. Theorem. // the compact system (X, T) supports an invariant measure fi for 
which the corresponding measure theoretical system (X, X, /i, T) is a K -system, then 
(X,T) IS UPE. 

Applying this theorem together with the Jewett-Krieger theorem it is now possible 
to obtain a great variety of strictly ergodic UPE systems. 

Given a T-invariant probability measure yU on X, a pair {x,x') G X x X, x ^ 
x' is called a /i-entropy pair if for every Borel partition a = {Q, Q'^} of X with 
X G int (Q) and x' G int (Q^) the measure entropy h^{a) is positive. This definition 
was introduced by Blanchard, Host, Maass, Martinez and Rudolph in ^2] as a local 
generalization of Theorem l7.2[ It was shown in J2] that for every invariant probability 
measure /i the set E^ of /i-entropy pairs is contained in Ex- 

7.3. Theorem. Every measure entropy pair is a topological entropy pair. 

As in |1U] the main issue here is to understand the, sometimes intricate, relation 
between the combinatorial entropy hc{U) of a cover U and the measure theoretical 
entropy h^{'y) of a measurable partition 7 subordinate to U. 

7.4. Proposition. Let X = (X, X, fi, T) be a measure dynamical system. Suppose 
U = {U, V} is a measurable cover such that every measurable two-set partition 7 = 
{H,H'^} which (as a cover) is finer than U satisfies h^{'y) > 0; then hc{U) > 0. 

Since for a if-measure /i clearly every pair of distinct points is in E^, Theorem 17.21 
follows from Theorem 17.31 It was shown in that when (X, T) is uniquely ergodic 
the converse of Theorem 17.31 is also true: Ex = E^ for the unique invariant measure 
fi on X. 

7.3. A measure attaining the topological entropy of an open cover. In order 
to gain a better understanding of the relationship between measure entropy pairs and 
topological entropy pairs one direction of a variational principle for open covers (The- 
orem [731 below) was proved in Blanchard, Glasner and Host |^10j. Two applications of 
this principle were given in JU]; (i) the construction, for a general system (X, T), of a 
measure /i G Mr(X) with Ex = E^, and (ii) the proof that under a homomorphism 
TT : (X, /i, T) — > (y, I/, T) every entropy pair in Ey is the image of an entropy pair in 
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We now proceed with the statement and proof of this theorem which is of indepen- 
dent interest. The other direction of this variational principle will be proved in the 
following subsection. 

7.5. Theorem. Let {X, T) be a topological dynamical system, and U an open cover 
of X, then there exists a measure /i e Mt{X) such that hfji_{a) > /itopCU) for all Borel 
partitions a finer than U. 

A crucial element of the proof of the variational principle is a combinatorial lemma 
which we present next. We let : [0, 1] — > IR denote the function 

(t){x) = -tlogt for < i < 1; 0(0) = . 

Let £ = {1, 2, ...,£} be a finite set, called the alphabet; sequences uj — uji . . .Un £ 
for n > 1, are called words of length n on the alphabet £ . Let n and k be 

two integers with 1 < k < n. 

For every word ui of length n and every word 6 of length k on the same alphabet, 
we denote by p{0\uj) the frequency of appearances of 6 in a;, i.e. 

p{0\u) = TVT^^^*^ + ^i<^i+\ ■ ■ ■<^j+fe-i = ^1^2 ■■■Ok] ■ 

Tl fv ~r J- 

For every word u; of length n on the alphabet £, we let 

7.6. Lemma. For every /i > 0, e > 0, every integer k>l and every sufficiently large 
integer n, 

card {cj G £" : Hk{oj) < kh] < exp {n{h + e)) . 
Remark. It is equally true that, if h < log(card£), for sufficiently large n, 
card {cu G £" : Hk{uj) < kh] > exp [n{h - e)) . 
We do not prove this inequality here, since we have no use for it in the sequel. 

Proof. The case k — 1. 

We have 

En ^ 
—— r 

where K is the set of g = (gi, . . . , qi) G such that 

e e 
^^Qi — n and )</i. 

i=l i=l ^ 

By Stirling's formula, there exist two universal constants c and c' such that 



c(-rv^<m! <c'(-rv^ 

e e 
for every m > 0. From this we deduce the existence of a constant C{£) such that for 
every q & K, 



< 



C{£) exp (nj^^l^i-)) ^ ^{1) exp(n/i) . 



qil...qe\ ~[ ^ 
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Now the sum (j2I) contains at most {n + 1)^ terms; so that we have 

card {cj G : Hi{uj) < h} < {n + exp{nh) < exp {n{h + e)) 

for all sufficiently large n, as was to be proved. 
The case k > 1. 

For every word u of length n > 2k on the alphabet £, and for < j < A;, we let rij 
be the integral part of and a;^-'^ the word 

of length Uj on the alphabet B = . 

Let now ^ be a word of length k on the alphabet £; we also consider 9 as an element 
of B. One easily verifies that, for every word uj of length n on the alphabet £, 

fc-i 



\p{e\uj)-\Y.v{o\ 



< 



k 



j=0 



n-2k + l 



The function being uniformly continuous, we see that for sufficiently large n, and 
for every word lj of length n on £, 



j=0 



e 

< - 
2 



and by convexity of 



^ k-l fc-1 

Thus, if Hk{uj) < kh, there exists a j such that Hi(uj^^^) < f + A^/i- 

Now, given j and a word u of length on the alphabet B, there exist < £2fc-2 

words to of length n on H such that to^^^ = u. Thus for sufficiently large n, by the 
ffist part of the proof, 

fc-1 

card {a; e £" : Hk{uj) < kh} < ^ card {u G 5"^ : i^i(M) < | + kh] 

j=0 

fe-i 

exp {nj{e + kh)) 

3=0 

e 



< 



n2k-2 



<f''~^kexp {n{- + h)) < exp {n{h + e)) 



□ 



Let (X, T) be a compact dynamical system. As usual we denote by Mt{X) the 
set of T-invariant probability measures on X, and by M^^{X) the subset of ergodic 
measures. 

We say that a partition a is finer than a cover IX when every atom of a is contained 
in an element of 11. If a = {Ai, . . . , Ai} is a partition of X, x G X and X G N, we 
write uj{a,N, x) for the word of length N on the alphabet £, = {!,...,£} defined by 

uj{a,N,x)n = i if T"~^x G A^, l<n<X. 



26 E. GLASNER AND B. WEISS 

7.7. Lemma. Let U be a cover of X , h = htopiU), K > 1 an integer, and {a/ : 1 < 
I < K} a finite sequence of partitions of X , all finer than U. For every e > and 
sufficiently large N, there exists an x E X such that 

Hk{oj{ai, N, x)) > k{h — e) for every k, I with 1 < k, I < K. 

Proof. One can assume that all the partitions ai have the same number of elements 
£ and we let £ = {1, ... , £}. For 1 < A; < /sT and > iT, denote 

Vt{N, k) = {uje2.^ : Hk{uj) < k{h - e)} . 

By Lemma (7.31 for sufficiently large 

card {n{N, k)) < exp{N{h - e/2)) for all k < K. 

Let us choose such an which moreover satisfies < exp{Ne/2). For 1 < k,l < K, 
let 

Z{k, l) = {xeX : uj{ai, N, x) G Q{N, k)} . 

The set Z{k, I) is the union of card (fi(A^, k)) elements of {ai)^'^. Now this parti- 
tion is finer than the cover 1X^~^, hence Z{k, I) is covered by 

card {n{N, k)) < exp{N{h - e/2)) 

elements ofU^"^. Finally, 

U ^(^'0 

l<k,l<K 

is covered by K'^ exp(N{h — e/2)) < exp(Nh) elements of Uq^^. As every subcover 
of IX^"^ has at least exp{Nh) elements, 

U Zik,l)^X. 

l<k,l<K 

This completes the proof of the lemma. □ 

Proof of theorem ]?.^ Let U = {f/i,...,f/^} be an open cover of X. It is clearly 
sufficient to consider Borel partitions a of X of the form 

(3) a = {Ai, . . . , Ai} with Ai C Ui for every i. 
Step 1: Assume first that X is 0-dimensional. 

The family of partitions finer than It, consisting of clopen sets and satisfying (j^I) is 
countable; let {ai : Z > 1} be an enumeration of this family. According to the previous 
lemma, there exists a sequence of integers tending to +cx3 and a sequence xk of 
elements of X such that: 

(4) Hk{uj{ai,NK,XK)) > k{h - -^) for every \<k, I < K. 
Write 

^ Nk-I 

Replacing the sequence fix by a subsequence (this means replacing the sequence Nk 
by a subsequence, and the sequence xk by the corresponding subsequence preserving 
the property (jH)), one can assume that the sequence of measures fix converges weak* 
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to a probability measure /i. This measure /i is clearly T-invariant. Fix k,l >1, and 
let F be an atom of the partition (ai)Q~^, with name 9 G {1, . . . For every K 

one has 

2k 

\fiK{F)-p{e\uj{ai,NK,XK)) \ < 

Now as F is clop en, 

= lim hk^F) = lim p{0\uj{ai^ Nk^xk)) hence 

K—*oo K—*oo 

= lim <j){p{e\uj{ai,NK,XK))) 

K — ^oo 

and, summing over 6^ G {1, . . . , f}'^, one gets 

K — ^oo 

Finally, by sending k to infinity one obtains h^{ai) > h. 

Now, as X is 0-dimensional, the family of partitions {a/} is dense in the collection 
of Borel partitions of X satisfying Q, with respect to the distance associated with 
Thus, h^{a) > h for every partition of this kind. 

Step 2: The general case. 

Let us recall a well known fact: there exists a topological system (Y,T), where Y 
is 0-dimensional, and a continuous surjective map tt : Y ^ X with vr o T = T o tt. 

(Proof : as X is a compact metric space, it is easy to construct a Cantor set K 
and a continuous surjective f : K ^ X. Put 

Y = {yeK^: f{yn+i) = Tf{y^) for every n G Z} 

and let vr : y ^ X be defined by n{y) = f{yo)- 

F is a closed subset of K'^ — where the latter is equipped with the product topology 
— and is invariant under the shift T on K^. It is easy to check that vr satisfies the 
required conditions.) 

Let V = 7i~^(U) = {7!'^^{Ui), . . . , 7!'^^{Ud)} be the preimage of U under n ; one has 
^topC\^) = ^topCU-) = h. By the above remark, there exists u G M{Y,T) such that 
hv{Q) > h for every Borel partition Q of y finer than V. Let ^ = v o tt^^ the measure 
which is the image of v under vr. One has /i G Mt{X) and, for every Borel partition 
a of X finer than U, 7r~^(a) is a Borel partition of Y which is finer than V with 

This completes the proof of the theorem. □ 

7.8. Corollary. Let (X, T) be a topological system, li an open cover of X and a a 
Borel partition finer than U, then, there exists a T-invariant ergodic measure fj, on X 
such that h^{a) > htopiXi). 

Proof. By Theorem 17.51 there exists /i G Mt{X) with h^{a) > htopill); let /i = 
fJ'uj dm{uj) be its ergodic decomposition. The corollary follows from the formula 

j hf,^{a) dm{uj) = h^{a). 

□ 
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7.4. The variational principle for open covers. Given an open cover U of the 
dynamical system [X, T) , the results of the previous subsection imply the inequality 

sup inf hi^{a) > htop{U). 

We will now present a new result which will provide the fact that 

sup inf h^{a) = htopiU) 

thus completing the proof of a variational principle for U. 
We first need a universal version of the Rohlin lemma. 

7.9. Proposition. Let {X,T) be a (Polish) dynamical system and assume that there 

exists on X a T -invariant aperiodic probability measure. Given a positive integer 
n and a real number 5 > there exists a Borel subset B G X such that the sets 
B, TB, . . . , T''^^B are pairwise disjoint and for every aperiodic T -invariant probability 
measure ix e Mt{X) we have Ai(IJj=o ^"'-^) > 1 — f^- 

Proof. Fix N (it should be larger than n/5 for the required height n and error 5). The 
set of points that are periodic with period < is closed. Any point in the complement 
(which by our assumption is nonempty) has, by continuity, a neighborhood U with 

N disjoint forward iterates. There is a countable subcover of such sets since 

the space is Polish. Take Ai = f/i as a base for a Kakutani sky-scraper 

{T^Al:j = 0,...,k-l; k=l,2,...}, 

A'l^ixeAi :rAi(x) = 

where TA^ix) is the first integer j >1 with T^x e Ai. Next set 

[(fe-n-l)/n] 

Bi = \J U T^M^, 

k>l j=0 

SO that the sets Bi, TB^, . . . , T"~^i?i are pairwise disjoint. 

Remove the full forward T orbit of C/i from the space and repeat to find B2 using as 
a base for the next Kakutani sky-scraper A2 defined as intersected with the part of 
X not removed earlier. Proceed by induction to define the sequence Bi, i = 1,2, . . . 
and set B = IJi^i By Poincarc recurrence for any aperiodic invariant measure 
we exhaust the whole space except for n iterates of the union A of the bases of the 
Kakutani sky-scrapers. By construction A — |Jm=i ^ disjoint iterates so that 

H{A) < l/N for every /i e Mt{X). Thus B,TB, . . .,T'^-^B fiU aU but n/N < 5 oi 
the space uniformly over the aperiodic measures e Mt{X). □ 

Let (X, T) be a dynamical system and IC = {Ui, U2, ■ ■ ■ Ue} a finite open cover. We 
denote by A the collection of all finite Borel partitions a which refine U, i.e. for every 
A e a there is some U eU with A C U. We set 

h{U) — sup inf hij_{a) and h{11) = inf sup h^{a). 

7.10. Proposition. Let {X,T) be a dynamical system, 11 = {Ui,U2, ■ ■ - Ui} a finite 
open cover, then 
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1. h{U) < h{U), 

2. h{U) < /itop(lX). 

Proof. 1. Given z/ G Mt{X) and a ^ A we obviously have h,^{a) < sup^gjv/j,(x) h^{a). 
Thus 

inf /Ij,(q;) < inf sup h^{a) = h(U), 

and therefore also h{Vi) < h{Vi). 

2. Choose for e > an integer large enough so that there is a sub cover D C 
1X^-1 = V^^-i T~m of cardinality 2^('^'°p(")+^). Apply Proposition O to find a set 5 
such that the sets B, TB, . . . , T^-^B are pairwise disjoint and for every T-invariant 
Borel probability measure /x G Mt(A:) we have fi{[jjSo T^B) > 1 - 6. Consider 
T)b = {D n B : D E T)}, the restriction of the cover D to B, and find a partition /3 
of B which refines 2)^. Thus each element P E (3 has the form 

P = P^o,^u...^N-^ C (^f]T-'U,^ HB, 

where C\f=o 'P^'^^ij represents a typical element of T). Next use the partition P of B 
to define a partition a = {Ai : i = 1, ...,£} of Ujl^^ ^'^-^ ^y assigning to the set Ai 
all sets of the form Pio,ii,...,ij,...,iff_i where ij = i (j can be any number in [0, A^ — 1]). 
On the remainder of the space a can be taken to be any partition refining 11. 
Now if A^ is large and 6 small enough then 

(5) h^{a) < /itop(lX) + 2e. 

Here is a sketch of how one establishes this inequality. For n >> N we will estimate 
iJ^(aQ~^) by counting how many {n, a)-names are needed to cover most of the space. 
We take 5 > so that ^/S « e. Denote E = BUTBU ■■■U T^-^B (so that 
fi{E) > 1-5). Define 

1 " 

i=0 

and observe that < / < 1 and 

/ f{x) dfi{x) > 1 — 5, 
Jx 

since T is measure preserving. Therefore J {1 — f) < 6 and (Markov's inequality) 

: (1 - /) > v^} < y"(l - /) < ^r6. 

It follows that for points a;inG = {/>l — v^}, we have the property that G E 
for most i in [0,n]. 

Partition G according to the values of i for which T*x G B. This partition has at 
most 




sets, a number which is exponentially small in n (if A^ is sufficiently large). 
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For a fixed choice of these values the times when we are not in E take only n\/S 
values and there we have < choices. 

Finally when T*x G -B we have at most 2*^^'''**°p*^'^'*+'^''-' names so that the total 
contribution is < 2(^(''*°p(^)+'»t . 

Collecting these estimations we find that 

H{a^-')<n{h,^^{U)+2e), 
whence (jS)). This completes the proof of the proposition. □ 

We finally obtain: 

7.11. Theorem (The variational principle for open covers). Let {X, T) be a dynamical 
system, IL = {Ui, U2, ■ ■ ■ Uk} a finite open cover and denote by A the collection of all 
finite Borel partitions a which refine IL, then 

1. for every fi G Mt{X), mfa(. a h^{a) < /itop(^), and 

2. there exists an ergodic measure /iq G Mt{X) with h^^^{a) > htopiU) for every 
Borel partition a & A. 

3. 

h{U) = h{U) = /itop(lX). 

Proof. 1. This assertion can be formulated by the inequality h{U) < /itop(^) and it 
follows by combining the two parts of Lemma 17.101 

2. This is the content of Theorem 17.51 

3. Combine assertions 1 and 2. □ 



7.5. Further results connecting topological and measure entropy. Given a 
topological dynamical system (X, T) and a measure /i G Mt{X), let vr : {X, X, /i, T) — »• 
{Z, Z,ri,T) be the measure-theoretical Pinsker factor of (X, X, /i, T), and let n = 
J^fizdri{z) be the disintegration of fi over {Z,r]). Set 

= / (/Wz X l^z) dri{z), 

the relatively independent joining of fi with itself over rj. Finally let = supp (A) be 
the topological support of A in X x X. Although the Pinsker factor is, in general, only 
defined measure theoretically, the measure A is a well defined element of Mrpxri^ x 
X). It was shown in Glasner |3]jj that = A^ \ A. 

7.12. Theorem. Let (X, T) be a topological dynamical system and let fi G Mt{X). 

1. = \ A and = E^U {{x,x) : x G supp (/u)}. 

2. dsEf, C A^. 

3. If is ergodic with positive entropy then chE^ = A^. 

One consequence of this characterization of the set of yU-entropy pairs is a descrip- 
tion of the set of entropy pairs of a product system. Recall that an ii^-system is a 
system for which there exists a probability invariant measure with full support. 

7.13. Corollary. Let (Xi,T) and (X2,T) be two topological E- systems then: 
1. Ex 1XX2 — {Exi X EX2) U (-Exi X Axa) U (Axi x Exi)- 
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2. The product of two UPE systems is UPE. 

Another consequence is: 

7.14. Corollary. Let {X,T) be a topological dynamical system, P the proximal rela- 
tion on X . Then: 

1. For every T-invariant ergodic measure fi of positive entropy the set P (1 is 
residual in the Gs set E^ of fi entropy pairs. 

2. When Ex 7^ the set PdEx is residual in the Gs set Ex of topological entropy 
pairs. 

Given a dynamical system {X, T) , a pair {x, x') e X x X is called a Li— Yorke pair 
if it is a proximal pair but not an asymptotic pair. A set S* C X is called scrambled 
if any pair of distinct points {x, C 5* is a Li- Yorke pair. A dynamical system 
{X, T) is called chaotic in the sense of Li and Yorke if there is an uncountable 
scrambled set. In ^T] Theorem 17. 121 is applied to solve the question whether positive 
topological entropy implies Li- Yorke chaos as follows. 

7.15. Theorem. Let {X,T) be a topological dynamical system. 

1. // (Y, T) admits a T-invariant ergodic measure fx with respect to which the 
measure preserving system {X, X, /i, T) is not measure distal then {X, T) is 
Li-Yorke chaotic. 

2. If {X,T) has positive topological entropy then it is Li-Yorke chaotic. 

In ^3] Blanchard, Host and Ruette show that in positive entropy systems there are 
also many asymptotic pairs. 

7.16. Theorem. Let {X,T) be a topological dynamical system with positive topological 
entropy. Then 

1. The set of points x & X for which there is some x' ^ x with {x,x') an as- 
ymptotic pair, has measure 1 for every invariant probability measure on X with 
positive entropy. 

2. There exists a probability measure v on X x X such that v a.e. pair (x, x') is 
Li-Yorke and positively asymptotic; or more precisely for some 5 > 

lim (i(T''x, T"x') = 0, and 
lim inf diT'^'x, T'^'x') = 0, lim sup diT-^'x, T-'^x') > 6. 

7.6. Topological determinism and zero entropy. Following [SI] call a dynamical 
system {X, T) deterministic if every T-factor is also a T~^-factor. In other words 
every closed equivalence relation R G X x X which has the property TR C R also 
satisfies T~^R C R. It is not hard to see that an equivalent condition is as follows. 
For every continuous real valued function / G C{X) the function f oT~^ is contained 
in the smallest closed subalgebra A C C{X) which contains the constant function 1 
and the collection {/oT" : n > 0}. The folklore question whether the latter condition 
implies zero entropy was open for awhile. Here we note that the affirmative answer 
is a direct consequence of Theorem 17. 161 fsee also |54j). 
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7.17. Proposition. Let {X,T) be a topological dynamical system such that there 
exists a 6 > and a pair {x,x') & X x X as in Theorem \7.1(^ 2. Then {X,T) is not 
deterministic. 

Proof. Set 

R = {(T"x, TV) : n > 0} U {(TV, T"x) : n > 0} U A. 

Clearly i? is a closed equivalence relation which is T-invariant but not ^-invariant. 

□ 

7.18. Corollary. A topologically deterministic dynamical system has zero entropy. 

Proof. Let (X, T) be a topological dynamical system with positive topological en- 
tropy; by Theorem 17.161 2. and Proposition 17. 1 7l it is not deterministic. □ 
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Part 2. Meeting grounds 



8. Unique ergodicity 



The topological system {X,T) is called uniquely ergodic if Mt{X) consists of a 
single element /i. If in addition /i is a full measure (i.e. supp fi = X) then the system is 
called strictly ergodic (see |l6t Section 4.3]). Since the ergodic measures are char- 
acterized as the extreme points of the Choquet simplex Mt{X), it follows immediately 
that a uniquely ergodic measure is ergodic. For a while it was believed that strict 
ergodicity — which is known to imply some strong topological consequences (like in 
the case of Z-systems, the fact that every point of X is a generic point and moreover 
that the convergence of the ergodic sums A„(/) to the integral / f dfi, f G C{X) is 
uniform) — entails some severe restrictions on the measure-theoretical behavior of 
the system. For example, it was believed that unique ergodicity implies zero entropy. 
Then, at first some examples were produced to show that this need not be the case. 
Furstenberg in [221 ^^"^ Hahn and Katznelson in jH] gave examples of uniquely er- 
godic systems with positive entropy. Later in 1970 R. I. Jewett surprised everyone 
with his outstanding result: every weakly mixing measure preserving Z-system has a 
strictly ergodic model, [20] • This was strengthened by Krieger ^59j who showed that 
even the weak mixing assumption is redundant and that the result holds for every 
ergodic Z-system. 

We recall the following well known characterizations of unique ergodicity (see jHSl 
Theorem 4.9]). 

8.1. Proposition. Let {X,T) be a topological system. The following conditions are 
equivalent. 

1. {X,T) is uniquely ergodic. 

2. C{X) = R + B, where B = {g-goT:ge C{X)}. 

3. For every continuous function f G C{X) the sequence of functions 



converges uniformly to a constant function. 

4. For every continuous function f G C{X) the sequence of functions A„(/) con- 
verges pointwise to a constant function. 

5. For every function f & A, for a collection A C C{X) which linearly spans a 
uniformly dense subspace of C{X), the sequence of functions A„(/) converges 
pointwise to a constant function. 

Given an ergodic dynamical system X = (X, X, /i, T) we say that the system X = 
(X, X, fi, T) is a topological model (or just a model) for X if (X, T) is a topological 
system, fi G Mt{X) and the systems X and X are measure theoretically isomorphic. 
Similarly we say that : X — »• Y is a topological model for vr : X — > Y when tt is 
a topological factor map and there exist measure theoretical isomorphisms and i(j 
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such that the diagram 

X -X 



Y— -Y 

is commutative. 

9. The relative Jewett-Krieger theorem 

In this subsection we will prove the following generalization of the Jewett-Krieger 
theorem (see ^Hl Theorem 4.3.10]). 

9.1. Theorem. // tt : X = (X, X, /i,T) — Y = (y,y,z/, T) is a factor map with 
X ergodic and Y is a uniquely ergodic model for Y then there is a uniquely ergodic 
model X for X and a factor map tt : % ^ Y which is a model for tt : X ^ Y. 

In particular, taking Y to be the trivial one point system we get: 

9.2. Theorem. Every ergodic system has a uniquely ergodic model. 

Several proofs have been given of this theorem, e.g. see ^\ and jZj. We will sketch 
a proof which will serve the relative case as well. 

Proof of theorem \9.1\ A key notion for this proof is that of a uniform partition whose 
importance in this context was emphasized by G. Hansel and J. -P. Raoult, 02] • 

9.3. Definition. A set S G X is uniform if 

N-l 



lim ess-supj 



^ iB{rx)-ii{B) =0. 



A partition T is uniform if, for all A^, every set in V-at T'^y is uniform. 

The connection between uniform sets, partitions and unique ergodicity lies in 
Proposition 18.11 It follows easily from that proposition that if T is a uniform parti- 
tion, say into the sets {Pi, P2, ■ ■ ■ , Pa}, and we denote by 7 also the mapping that 
assigns to x G X, the index 1 < i < a such that x E Pi, then we can map X to 
{1, 2,...,a}^ = by: 

tt{x) = (..., ?{T-^x), ?{x), ?{Tx), . . . , T(T"x), . . .). 

Pushing forward the measure fi by tt, gives vroyu and the closed support of this measure 
will be a closed shift invariant subset, say E C A^. Now the indicator functions of 
finite cylinder sets span the continuous functions on E, and the fact that J* is a 
uniform partition and Proposition 18 . II combine to establish that {E, shift) is uniquely 
ergodic. This will not be a model for (X, X, /i, T) unless V^^oo T""^"? = X modulo 
null sets, but in any case this does give a model for a nontrivial factor of X. 

Our strategy for proving Theorem l9.2l is to first construct a single nontrivial uniform 
partition. Then this partition will be refined more and more via uniform partitions un- 
til we generate the entire a-algebra X. Along the way we will be showing how one can 
prove a relative version of the basic Jewett-Krieger theorem. Our main tool is the use 
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of Rohlin towers. These are sets B such that for some N. B. TB, .... T B are 
disjoint while Uo^~^ most of the space. Actually we need Kakutani-Rohlin 

towers, which are like Rohlin towers but fill up the whole space. If the transformation 
does not have rational numbers in its point spectrum this is not possible with a single 
height, but two heights that are relatively prime, like N and + 1 are certainly 
possible. Here is one way of doing this. The ergodicity of (X, X, fi, T) with n non 
atomic easily yields, for any n, the existence of a positive measure set B, such that 

T BnB = % , i = l, 2,...,n. 

With given, choose n > 10 • A^^ and find B that satisfies the above. It follows that 
the return time 

reix) = mi{i >0:T'xeB} 
is greater than 10 • N"^ on B. Let 

Bi = {x : rB{x) = i}. 

Since i is large (if B^ is nonempty) one can write £ as a positive combination of N 

and N X 1, say 

£ = Nue + {N + l)vi. 

Now divide the column of sets {T'Bf : < i < £} into w^-blocks of size N and V(,- 
blocks of size 1 and mark the first layer of each of these blocks as belonging to C. 
Taking the union of these marked levels {T'^B^ for suitably chosen i) over the various 
columns gives us a set C such that rc takes only two values - either A'" or A'" + 1 as 
required. 

It will be important for us to have at our disposal K-R towers like this such that 
the columns of say the second K-R tower are composed of entire subcolumns of the 
earlier one. More precisely we want the base C2 to be a subset of Ci - the base of 
the first tower. Although we are not sure that this can be done with just two column 
heights we can guarantee a bound on the number of columns that depends only on 
the maximum height of the first tower. Let us define formally: 

9.4. Definition. A set C will be called the base of a bounded K-R tower if for some 
A^, [Jq ^^ T^C = X up to a /i-mill set. The least A^ that satisfies this will be called 
the height of C, and partitioning C into sets of constancy of rc and viewing the 
whole space X as a tower over C will be called the K-R tower with columns the sets 
{T'C^ : 0<i<i}hrCf = {xeC : rdx) = i}. 

Our basic lemma for nesting these K-R towers is: 

9.5. Lemma. Given a bounded K-R tower with base C and height N , for any n 
sufficiently large there is a hounded K-R tower with base D contained in C whose 
column heights are all at least n and at most n -\- AN. 

Proof We take an auxihary set B such that 5n5 = for all < i < 10{n + 2Ny 
and look at the unbounded (in general) K-R tower over B. Using the ergodicity it 
is easy to arrange that B C C. Now let us look at a single column over S^, with 

m > 10 {n-\-2Ny. We try to put down blocks of size n + 2N and n + 2A^ + l, to fill up 
the tower. This can certainly be done but we want our levels to belong to C. We can 
refine the column over Bm into a finite number of columns so that each level is either 
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entirely within C or in X\C. This is done by partitioning the base C according to 
the finite partition: 

m— 1 

fl T-^C, X\C}. 

i=0 

Then we move the edge of each block to the nearest level that belongs to C. The fact 
that the height of C is means that we do not have to move any level more than 
— 1 steps, and so at most we lose 2N — 2 or gain that much thus our blocks, with 
bases now all in C, have size in the interval [n, n + 4A^] as required. □ 

It is clear that this procedure can be iterated to give an infinite sequence of nested 
K-R towers with a good control on the variation in the heights of the columns. These 
can be used to construct uniform partitions in a pretty straightforward way, but we 
need one more lemma which strengthens slightly the ergodic theorem. We will want 
to know that when we look at a bounded K-R tower with base C and with minimum 
column height sufficiently large that for most of the fibers of the towers (that is for 
X G C, {T^x : < z < rc{x)}) the ergodic averages of some finite set of functions 
are close to the integrals of the functions. It would seem that there is a problem 
because the base of the tower is a set of very small measure (less than 1/min column 
height) and it may be that the ergodic theorem is not valid there. However, a simple 
averaging argument using an intermediate size gets around this problem. Here is the 
result which we formulate for simplicity for a single function /: 

9.6. Lemma. Let f be a bounded function and (X, X, /i, T) ergodic. Given e > 0, 
there is an no, such that if a bounded K-R tower with base C has minimum column 
height at least uq, then those fibers over x E C : {T*x : < i < rc(x)} that satisfy 

rc{x)- 



< e 



rc{x) 

fill up at least 1 — e of the space. 

Proof. Assume without loss of generality that |/| < 1. For a 5 to be specified later 
find an N such that the set of y E X which satisfy 

1 

(6) 



fdfi 



< 6 



has measure at least 1 — 6. Let us denote the set of y that satisfy (jHl) by E. Suppose 
now that no is large enough so that N/uo is negligible - say at most S. Consider a 
bounded K-R tower with base C and with minimum column height greater than no. 
For each fiber of this tower, let us ask what is the fraction of its points that lie in 
E. Those fibers with at least a a/5 fraction of its points not in E cannot fill up more 
than a a/5 fraction of the space, because /i(i?) > 1 — S. 

Fibers with more than 1 — of its points lying in E can be divided into disjoint 
blocks of size N that cover all the points that lie in E. This is done by starting at 
X E C, and moving up the fiber, marking the first point in E, skipping N steps and 
continuing to the next point in E until we exhaust the fiber. On each of these N- 
blocks the average of / is within 5 of its integral, and since |/| < 1 if ^/6 < e/10 this 
will guarantee that the average of / over the whole fiber is within e of its Integra. □ 
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We are now prepared to construct uniform partitions. Start with some fixed non- 
trivial partition ^Pq- By Lemma f9 .61 for any tall enough bounded K-R tower at least 
9/10 of the columns will have the 1-block distribution of each To-name within ^ of 
the actual distribution. We build a bounded K-R tower with base Ci(l) and heights 
A^^i, A^^i + 1 with A^^i large enough for this to be valid. It is clear that we can modify CPq 
to Ti on the bad fibers so that now all fibers have a distribution of 1-blocks within ^ 
of a fixed distribution. We call this new partition Ti. Our further changes in Ti will 
not change the A^^i, A^^i + 1 blocks that we see on fibers of a tower over our ultimate 
Ci. Therefore, we will get a uniformity on all blocks of size lOOA'^i. The 100 is to get 
rid of the edge effects since we only know the distribution across fibers over points in 

Next we apply Lemma 19.61 to the 2-blocks in Ti with 1/100. We choose N2 so 
large that N1/N2 is negligible and so that any uniform K-R tower with height at least 
N2 has for at least 99/100 of its fibers a distribution of 2-blocks within 1/100 of the 
global Ti distribution. Apply Lemma 19.51 to find a uniform K-R tower with base 
6*2(2) C Ci(l) such that its column heights are between N2 and N2 + ANi. For the 
fibers with good distribution we make no change. For the others, we copy on most 
of the fiber (except for the top 10 ■ Nf levels) the corresponding Ji-name from one of 
the good columns. In this copying we also copy the Ci(l)-name so that we preserve 
the blocks. The final 10 ■ Nf spaces are filled in with A^^i, A^^i + 1 blocks. This gives us 
a new base for the first tower that we call Ci(2), and a new partition J'2. The features 
of ^2 are that all its fibers over Ci(2) have good (up to 1/10) 1-block distribution, 
and all its fibers over 6*2(2) have good (up to 1/100) 2-block distributions. These will 
not change in the subsequent steps of the construction. 

Note too that the change from Ci(l), to Ci(2), could have been made arbitrarily 
small by choosing N2 sufficiently large. 

There is one problem in trying to carry out the next step and that is, the filling 
in of the top relatively small portion of the bad fibers after copying most of a good 
fiber. We cannot copy an exact good fiber because it is conceivable that no fiber with 
the precise height of the bad fiber is good. The filling in is possible if the column 
heights of the previous level are relatively prime. This was the case in step 2, because 
in step 1 we began with a K-R tower heights Ni,Ni + 1. However, Lemma (9.51 does 
not guarantee relatively prime heights. This is automatically the case if there is no 
rational spectrum. If there are only a finite number of rational points in the spectrum 
then we could have made our original columns with heights LA^i, L{Ni + 1) with L 
being the highest power so that is not ergodic and then worked with multiples 
of L all the time. If the rational spectrum is infinite then we get an infinite group 
rotation factor and this gives us the required uniform partition without any further 
work. 

With this point understood it is now clear how one continues to build a sequence of 
partitions y„ that converge to J* and Ci{k) Ci such that the J-names of all fibers 
over points in Ci have a good (up to 1/10*) distribution of z-blocks. This gives the 
uniformity of the partition T as required and establishes 

9.7. Proposition. Given any ^.n-d any e > there is a uniform partition J* such 
that (i(yo) J') < e in the ii-metric on partitions. 
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As we have already remarked the uniform partition that we have constructed gives 
us a uniquely ergodic model for the factor system generated by this partition. We 
need now a relativized version of the construction we have just carried out. We 
formulate this as follows: 

9.8. Proposition. Given a uniform partition 7 and an arbitrary partition Qq that 
refines 7, for any e > there is a uniform partition Q that also refines J" and satisfies 

||Qo-Q||i <e. 

Even though we write things for finite alphabets, everything makes good sense 
for countable partitions as well and the arguments need no adjusting. However, the 
metric used to compare partitions becomes important since not all metrics on ii are 
equivalent. We use always: 

l|Q-Q||i = E / IIq.-Iq.M/^ 

where the partitions Q and Q are ordered partitions into sets {Qj}, {Qj} respectively. 
We also assume that the cr-algebra generated by the partition 7 is nonatomic - 
otherwise there is no real difference between what we did before and what has to be 
done here. 

We will try to follow the same proof as before. The problem is that when we 
redefine Qq to Q we are not allowed to change the [P-part of the name of points. That 
greatly restricts us in the kind of names we are allowed to copy on columns of K-R 
towers and it is not clear how to proceed. The way to overcome the difficulty is to 
build the K-R towers inside the uniform algebra generated by J". This being done 
we look, for example, at our first tower and the first change we wish to make in Qq. 
We divide the fibers into a finite number of columns according to the height and 
according to the IP-name. 

Next each of these is divided into subcolumns, called Qo-columns, according to the 
Qo-names of points. If a !P-column has some good (i.e. good 1-block distribution of 
Qg-names) Qo-subcolumn it can be copied onto all the ones that are not good. Next 
notice that a T-column that contains not even one good Qo-name is a set defined in 
the uniform algebra. Therefore if these sets have small measure then for some large 
enough N, uniformly over the whole space, we will not encounter these bad columns 
too many times. 

In brief the solution is to change the nature of the uniformity. We do not make 
all of the columns of the K-R tower good - but we make sure that the bad ones are 
seen infrequently, uniformly over the whole space. With this remark the proof of the 
proposition is easily accomplished using the same nested K-R towers as before - but 
inside the uniform algebra. 

Finally the J-K theorem is established by constructing a refining sequence of uni- 
form partitions and looking at the inverse limit of the corresponding topological 
spaces. Notice that if Q refines 7, and both are uniform, then there is a natural 
homeomorphism from Xq onto Xy. The way in which the theorem is established also 
yields a proof of the relative J-K theorem. Theorem 19.11 □ 



MEASURABLE AND TOPOLOGICAL DYNAMICS 



39 



Using similar methods E. Lehrer jSI] shows that in the Jewett-Krieger theorem 
one can find, for any ergodic system, a strictly ergodic model which is topologically 
mixing. 

10. Models for other commutative diagrams 

One can describe Theorem 19.11 as asserting that every diagram of ergodic systems 
of the form X ^ Y has a strictly ergodic model. What can we say about more 
complicated commutative diagrams? A moments refiection will show that a repeated 
application of Theorem 19.11 proves the first assertion of the following theorem. 

10.1. Theorem. Any commutative diagram in the category of ergodic Z dynamical 
systems with the structure of an inverted tree, i. e. no portion of it looks like 

(7) Z 




X Y 

has a strictly ergodic model. On the other hand there exists a diagram of the form 
([7j) that does not admit a strictly ergodic model. 

For the proof of the second assertion we need the following theorem. 

10.2. Theorem. If (Z,r],T) is a strictly ergodic system and {Z,T) iX,T) and 

{Z,T) -H> {Y,T) are topological factors such that a~^{U) fl P~^{V) ^ whenever 
U G X and V G Y are nonempty open sets, then the measure-preserving systems 
X = (X, X, /i, T) and Y = (F, z/, T) are measure-theoretically disjoint. In particular 
this is the case if the systems {X, T) and {Y, T) are topologically disjoint. 

Proof. It suffices to show that the map ax(3: Z^XxY is onto since this will 
imply that the topological system (X x Y, T) is strictly ergodic. We establish this 
by showing that the measure A = (a x [3)^{rj) (a joining of /i and v) is full; i.e. that 
it assigns positive measure to every set of the form U x V with U and V as in the 
statement of the theorem. In fact, since by assumption rj is full we have 

X{U xV)= r]{{a x (3y\U x V)) = via'^U) f] r\V)) > 0. 

This completes the proof of the first assertion. The second follows since topological 
disjointness of (X, T) and (Y, T) implies that axj3:Z^XxY is onto. □ 

Proof of Theorem Theorem \lU.l\ We only need to prove the last assertion. Take X = 
Y to be any nontrivial weakly mixing system, then X x X is ergodic and the diagram 

(8) X X X 




X X 

is our counter example. In fact if (|7j) is a uniquely ergodic model in this situation 
then it is easy to establish that the condition in Theorem 110.21 is satisfied and we 
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apply this theorem to conclude that X is disjoint from itself. Since in a nontrivial 
system /i x and gr (/x, id) are different ergodic joinings, this contradiction proves our 
assertion. □ 



11. The Furstenberg- Weiss almost 1-1 extension theorem 

It is well known that in a topological measure space one can have sets that are 
large topologically but small in the sense of the measure. In topological dynamics 
when (X, T) is a factor of {Y, T) and the projection vr : F — > X is one to one on 
a topologically large set (i.e. the complement of a set of first category), one calls 
{Y,T) an almost 1-1 extension of (X, T) and considers the two systems to be very 
closely related. Nonetheless, in view of the opening sentence, it is possible that the 
measure theory of {Y, T) will be quite different from the measure theory of (X, T) . 
The following theorem realizes this possibility in an extreme way (see |28j). 

11.1. Theorem. Let (X, T) be a non-periodic minimal dynamical system, and let vr : 
Y —>■ X be an extension of (X, T) with {Y, T) topologically transitive and Y a compact 
metric space. Then there exists an almost 1-1 minimal extension, vf : (F, T) (X, T) 
and a Borel subset Yq (Z Y with a Borel measurable map 9 : Yq ^ Y satisfying (1) 
9T = TO, (2) W9 = TT, (3) 6 is 1-1 on Yq, (4) /^(^o) = 1 for any T-invariant measure 
fi on Y . 

In words, one can find an almost 1-1 minimal extension of X such that the measure 
theoretic structure is as rich as that of an arbitrary topologically transitive extension 
of X. 

An almost 1-1 extension of a minimal equicontinuous system is called an almost 
automorphic system. The next corollary demonstrates the usefulness of this mod- 
elling theorem. Other applications appeared e.g. in and |19| . 

11.2. Corollary. Let {X,X, fi,T) be an ergodic measure preserving transformation 
with infinite point spectrum defined by {G, p) where G is a compact monothetic group 
G = {p"}„g^. Then there is an almost 1-1 minimal extension of {G,p) (i.e. a 
minimal almost automorphic system), {Z,a) and an invariant measure v on Z such 
that (Z, 0", I/) is isomorphic to (X, X, /x, T). 



12. Cantor minimal representations 

A Cantor minimal dynamical system is a minimal topological system (X, T) where 
X is the Cantor set. Two Cantor minimal systems (X, T) and (F, S) are called orbit 
equivalent (OE) if there exists a homeomorphism F : X ^Y such that F{Qt{x)) = 
Os{Fx) for every a; G X. Equivalently: there are functions n : X — > Z and m : X — »• Z 
such that for every x G X F{Tx) = 5"(^)(Fa;) and F(T™(^)) = S{Fx). An old result 
of M. Boyle implies that the requirement that, say, the function n{x) be continuous 
already implies that the two systems are flip conjugate; i.e. {Y, S) is isomorphic either 
to (X, T) or to (X, T^^). However, if we require that both n{x) and m{x) have at 
most one point of discontinuity we get the new and, as it turns out, useful notion of 
strong orbit equivalence (SOE). A complete characterization of both OE and SOE of 
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Cantor minimal systems was obtained by Giordano Putnam and Skau jSO] in terms 
of an algebraic invariant of Cantor minimal systems called the dimension group. (See 
for a review of these results.) 
We conclude this section with the following remarkable theorems, due to N. Ormes 
|67j . which simultaneously generalize the theorems of Jewett and Krieger and a the- 
orem of Downarowicz \TE\ which, given any Choquet simplex Q, provides a Cantor 
minimal system {X,T) with Mjn{X) affinely homeomorphic with Q. (See also Dow- 
narowitcz and Serafin [20], and Boyle and Downarowicz [T3].) 

12.1. Theorem. 1. Let {Q, B, u, S) be an ergodic, non-atomic, probability measure 
preserving, dynamical system. Let {X, T) be a Cantor minimal system such that 
whenever exp{2TTi/p) is a (topological) eigenvalue of {X,T) for some p G M 
it is also a (measurable) eigenvalue of (f2, 3, i/, S*). Let fi be any element of 
the set of extreme points of Mt{X). Then, there exists a homeomorphism 
T' : X — > X such that (i) T and T' are strong orbit equivalent, (ii) (fi, 3, i/, 5") 
and (X, X, /i, T') are isomorphic as measure preserving dynamical systems. 

2. Let (n, !B, J/, S) be an ergodic, non-atomic, probability measure preserving, dy- 
namical system. Let (X, T) be a Cantor minimal system and fi any element 
of the set of extreme points of Mr(X). Then, there exists a homeomorphism 
T' : X —>■ X such that (i) T and T' are orbit equivalent, (ii) (fi, B, z/, 5) and 
(X, X, /i, T') are isomorphic as measure preserving dynamical systems. 

3. Let (f2,S,i/, S") be an ergodic, non-atomic, probability measure preserving dy- 
namical system. Let Q be any Choquet simplex and q an extreme point of Q. 
Then there exists a Cantor minimal system (X, T) and an affine homeomor- 
phism (f) : Q ^ Mt{X) such that, with fi = (f){q), (fi,S,i/, S") and {X,X, fi,T) 
are isomorphic as measure preserving dynamical systems. 



13. Other related theorems 
Let us mention a few more striking representation results. 

For the first one recall that a topological dynamical system (X, T) is said to be 
prime if it has no non-trivial factors. A similar definition can be given for measure 
preserving systems. There it is easy to see that a prime system (X, X, fi, T) must have 
zero entropy. It follows from a construction in [75j that the same holds for topological 
entropy, namely any system (X, T) with positive topological entropy has non-trivial 
factors. In [SI] it is shown that any ergodic zero entropy dynamical system has a 
minimal model (X, T) with the property that any pair of points {u, v) not on the 
same orbit has a dense orbit in X x X. Such minimal systems are necessarily prime, 
and thus we have the following result: 

13.1. Theorem. An ergodic dynamical system has a topological, minimal, prime 
model iff it has zero entropy. 

The second theorem, due to Glasner and Weiss [lOj, treats the positive entropy 
systems. 

13.2. Theorem. An ergodic dynamical system has a strictly ergodic, UPE model iff 
it has positive entropy. 
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We also have the following surprising result which is due to Weiss |HS] . 

13.3. Theorem. There exists a minimal metric dynamical system {X,T) with the 
property that for every ergodic probability measure preserving system (f2,S,yU, 5) 
there exists a T -invariant Borel probability measure v on X such that the systems 

B, /X, 5") and (X, X, z/, T) are isomorphic. 

In jM] E. Lindenstrauss proves the following: 

13.4. Theorem. Every ergodic measure distal Z-system X = (X, X, /i,T) can be 
represented as a minimal topologically distal system (X, T, /i) with fi G My^(X). 

This topological model need not, in general, be uniquely ergodic. In other words 
there are measure distal systems for which no uniquely ergodic topologically distal 
model exists. 

13.5. Proposition. 1. There exists an ergodic non-Kronecker measure distal sys- 
tem {Q, 5", m, T) with nontrivial maximal Kronecker factor {Qq, rriQ, T) such 
that (i) the extension {Q,3^,m,T) {QQ,3^Q,mQ,T) is finite to one a.e. and 
(a) every nontrivial factor map of {flo,9^o,mQ,T) is finite to one. 

2. A system {fl,3^,m,T) as in part 1 does not admit a topologically distal strictly 
ergodic model. 

Proof. 1. Irrational rotations of the circle as well as adding machines are examples 
of Kronecker systems satisfying condition (ii). There are several constructions in the 
literature of ergodic, non-Kronecker, measure distal, two point extensions of these 
Kronecker systems. A well known explicit example is the strictly ergodic Morse 
minimal system. 

2. Assume to the contrary that (X, /i, T) is a distal strictly ergodic model for 
(^2,5", m, T). Let {Z,T) be the maximal equicontinuous factor of (X, T) and let rj be 
the unique invariant probability measure on Z. Since by assumption (X, fi, T) is not 
Kronecker it follows that vr : X — > Z is not one to one. By Furstenberg's structure 
theorem for minimal distal systems {Z, T) is nontrivial and moreover there exists an 
intermediate extension X ^ Y Z such that a is an isometric extension. A well 
known construction implies the existence of a minimal group extension p : {Y, T) — *■ 
{Z,T), with compact fiber group K, such that the following diagram is commutative 
(see Sectional above). We denote by u the unique invariant measure on Y (the image 
of /i) and let 9 be an ergodic measure on Y which projects onto u. The dotted arrows 
denote measure theoretic factor maps. 



(X,/i) 
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Next form the measure = j]^ RkudrriK, where rriK is Haar measure on K and for 



each k & K, Rk denotes right translation by A; on y (an automorphism of the system 
(y,T)). We still have (f){d) = u. 



A well known theorem in topological dynamics (see j71|) implies that a minimal 
distal finite to one extension of a minimal equicontinuous system is again equicon- 
tinuous and since {Z, T) is the maximal equicontinuous factor of (X, T) we conclude 
that the extension a : F — > Z is not finite to one. Now the fibers of the extension a 
are homeomorphic to a homogeneous space K/H, where if is a closed subgroup of 
K. Considering the measure disintegration 9 = J^6zdr]{z) of 6 over rj and its pro- 
jection z/ = f^i^z dri{z), the disintegration of z/ over r], we see that a.e. 9^ = rriK and 

= ^K/H- Since K/H is infinite we conclude that the measure theoretical extension 
a : (y, v) — * (Z, T]) is not finite to one. However considering the dotted part of the 
diagram we arrive at the opposite conclusion. This conflict concludes the proof of the 
proposition. □ 

In |HH] Ornstein and Weiss introduced the notion of tightness for measure preserving 
systems and the analogous notion of mean distality for topological systems. 

13.6. Definition. Let (X, T) be a topological system. 

1. A pair (x,?/) in X x X is mean proximal if for some (hence any) compatible 



If this lim sup is positive the pair is called mean distal. 

2. The system (X, T) is mean distal if every pair with x ^ y ys, mean distal. 

3. Given a T-invariant probability measure /i on X, the triple {X,n,T) is called 
tight if there is a /i-conuU set Xq C X such that every pair of distinct points 
(x, y) in Xo x Xq is mean distal. 

Ornstein and Weiss show that tightness is in fact a property of the measure pre- 
serving system (X, /i, T) (i.e. if the measure system (X, X, /i, T) admits one tight 
model then every topological model is tight). They obtain the following results. 

13.7. Theorem. 

1. If the entropy of {X, fi,T) is positive and finite then {X,fi,T) is not tight. 

2. There exist strictly ergodic non-tight systems with zero entropy. 

Surprisingly the proof in [HH] of the non-tightness of a positive entropy system does 
not work in the case when the entropy is infinite which is still open. 

J. King gave an example of a tight system with a non-tight factor. Following this 
he and Weiss jHE] established the following result. Note that this theorem implies 
that tightness and mean distality are not preserved by factors. 



metric d 




13.8. Theorem. If {X,X, iJ,,T) is ergodic with zero entropy then there exists a mean- 
distal system {Y, u, S) which admits (X, X, n, T) as a factor. 
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